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FIBER SHAFT MODIFICATIONS FOR EFFICIENT TARGETING 
RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. provisional application Serial No. 
60/350,388, filed 24 January 2002, entitled "FIBER SHAFT MODIFICATIONS 
5 FOR EFFICIENT TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R., and to U.S. provisional application Serial No. 
60/391,967, filed 26 June 2002, entitled "FIBER SHAFT MODIFICATIONS FOR 
EFFICIENT TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R. This application is also related to International 

10 PCT application No. (attorney docket number 22908-1236), filed the same day 
herewith, entitled "FIBER SHAFT MODIFICATIONS FOR EFFICIENT 
TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, Theodore and 
Nemerow, Glen R. Where permitted, the subject matter of each of these 
applications is incorporated by reference herein. 

15 FIELD OF INVENTION 

The present invention generally relates to the field of adenoviral vectors 
and the production of such vectors. In particular, detargeted adenoviral vectors 
are provided. 
BACKGROUND 

20 Most, if not all, adenoviral vector-mediated gene therapy strategies aim to 

transduce a specific tissue, such as a tumor or an organ. Systemic delivery will 
require ablation of the normal virus tropism as well as addition of new 
specificities. Multiple interactions between adenoviral particles and the host cell 
are required to promote efficient cell entry (Nemerow (2000) Virology 274:1-4). 

25 An adenovirus entry pathway is believed to involve two separate cell surface 
events. First, a high affinity interaction between the adenoviral fiber knob and 
coxsackie-adenovirus receptor (CAR) mediates the attachment of the adenovirus 
particle to the cell surface. A subsequent association of penton with the cell 
surface integrins a v /? 3 and a v ft s , which act as co-receptors, potentiates virus 

30 internalization. There are a plurality of adenoviral fiber receptors, which interact 
with the group B {e.g., Ad3) and group C (e.g., Ad5) adenoviruses. Both of 
these groups of adenoviruses appear to require interaction with integrins for 
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internalization. CAR ablation, however, does not change biodistribution and 
toxicity of adenoviral vectors in vivo (Alemany et aL (2001) Gene Therapy 
5:1347-1353; U.S. patent application No. 09/870,203, filed May 30, 2001, and 
published as U.S. Published application No. 20020137213). Thus, the role of 
5 CAR interaction for in vivo gene transfer is not clear. Recently published studies 
have described conflicting results (Alemany et aL (2001) Gene Therapy 
5:1347-1353; Leissner et aL (2001) Gene Therapy 6:49-57; Einfeld et aL (2001) 
J. Virology 75\\ 1 284-1 1 291 ). For example,it has been shown that vectors 
containing an S408E mutation in the Ad5 fiber AB loop yield efficient liver 
10 transduction in mice, despite having greatly reduced transduction efficiencies on 
cells in culture (see, Leissner et aL (2001) Gene Therapy 5:49-57). In contrast, 
vectors containing a more extensive fiber AB loop mutation showed a 10-fold 
reduction in liver gene expression (see, Einfeld et aL (2001) J. Virology 
75:1 1284-1 1291). 

15 A doubly ablated adenovirus has been prepared by modifying the CAR 

binding region in the fiber loop and the integrin binding region in the penton base 
(Einfeld et aL (2001 ) J. Virology 75:11284-11291). This doubly ablated 
adenovirus, lacking CAR and integrin interactions, was reported not only to lack 
in vitro transduction of various cell types but also to lack in vivo transduction of 

20 liver cells. Specifically, the doubly ablated adenovirus was reported to have a 
700 fold reduction in liver transduction when compared to the non-ablated 
adenovirus. These results, however, were not reproduced by others. 

For many applications, the most clinically useful adenoviral vector would 
be deliverable systemically, such as into a peripheral vein, and would be targeted 

25 to a desired location in the body, and would not have undesirable side effects 

resulting from targeting to other locations. In vivo adenoviral vector targeting is 
a major goal in gene therapy and a significant effort has been focused on 
developing strategies to achieve this goal. Successful targeting strategies would 
direct the entire vector dose to the appropriate site and would be likely to 

30 improve the safety profile of the vector by permitting the use of lower, less toxic 
vector doses, which potentially also can be less immunogenic. Thus, there is a 
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need to develop adenoviruses which are fully detargeted in vivo for use as a 
base vector for producing redirected adenoviruses. 

Therefore, among the objects herein, it is an object herein to provide fully 
detargeted adenoviral vectors, methods for preparation thereof, and uses 
5 thereof. 

SUMMARY 

Detargeted and fully detargeted adenoviral particles, adenovirus vectors 
from which such particles are produced, methods for preparation of the vectors 
and particles and uses of the vectors and particles are provided. Provided and 

10 described are capsid modifications, such as fiber shaft modifications, and the 
resuling proteins that, when expressed on adenoviral particles provide for 
detargeting of adenoviral vectors. The capsid modifications, such as the fiber 
shaft modifications, can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

15 adenoviral particles. Thus, adenoviral vectors and adenovirual particles whose 
native tropisms are ablated through a modification or modifications of capsid 
proteins, particularly a fiber shaft region, are provided. 

Thus, provided are capsid mutiations, including fiber shaft modifications, 
that ablate binding to particular receptors, thereby permitting efficient targeting 

20 of adenoviral vectors that contain capsids with such modifications. For example, 
adenoviral vectors in which the fiber shaft's interaction with HSP is ablated 
(reduced or substantially eliminated), particuarly in vivo, are provided. These 
fiber shaft modifications can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

25 adenoviral vectors. Also provided are retargeted vectors and particles that 

include a ligand or ligands to provide for targeting of the detargeted vectors and 
particles to selected cells and/or tissues. Retargeting can be effected, for 
example, by manipulating the fiber protein to redirect the receptor specificity to a 
particular cell type. 

30 Also provided are nucleic acids encoding the modified fiber proteins and 

also modified penton proteins. Also provided are nucleic acids encoding the 
modified fiber shaft protein that has ablated HSP binding and combinations 
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thereof with other modified fiber regions or other proteins, such as a modified 
fiber knob region and/or the modified penton protein. The nucleic acids also can 
contain heterologous nucleic acid sequences, such as promoters or nucleic acid 
sequences encoding polypeptides. The viral particles that express fibers 
5 containing such shaft modifications and other modifications are also provided. 

Also provided are methods for making and using the adenoviral particles 
that express the modified fibers and combinations of modified fibers and 
modified penton. With the fiber shaft modifications, particularly in combination 
with the fiber knob modifications and the penton modifications, the adenovirus 

10 particles are ablated for binding to their natural cellular receptor(s), i.e., they are 
detargeted. They can then be "retargeted" to a specific cell type through the 
addition of a ligand to the virus capsid, which causes the virus to bind to and 
infect such cell. The ligand can be added, for example, through genetic 
modification of a capsid protein gene. 

15 Also provided is a method for reducing liver toxicity in adeno- 

viral-mediated therapy. In contrast to the results of Einfeld et ah (Einfeld et ah 
(2001) J. Virology 75:1 1284-1 1291), it is shown herein that a doubly ablated 
adenovirus, lacking CAR and integrin interactions, is capable of in vivo liver 
transduction. It is shown herein that ablation of liver transduction requires 

20 further and/or alternative modification(s). The method for reducing liver toxicity 
in adenoviral-mediated therapy includes 

modifying an adenoviral vector to ablate native tropism to liver cells in vivo. 
Such vector can be administered to a subject. The modifications include the 
modifications described herein. 

25 The nucleic acids, proteins, adenoviral particles and adenoviral vectors 

have a variety of uses. These include in vivo and in vitro uses to target nucleic 
acid to particular cells and tissues, for therapeutic purposes, including gene 
therapy, and also for the identification and study of cell surface receptors and 
identification of modes of interaction of viruses with cells. 

30 In particular, adenoviral fiber shaft modifications that ablate viral 

interaction with HSP (Heparin Sulfate Proteoglycans; also referred to as heparin 
sulfate glycosaminoglycans) are provided. These modifications include 
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mutations of individual amino acids in the fiber shaft that interact with HSP or 
mutations of amino acids in the fiber shaft that modify the ability of the HSP 
binding motif to interact with HSP. Adenoviral fiber shaft modifications also 
include replacements of fiber shafts using fiber shafts of adenoviruses, such as, 
5 for example, Ad3, Ad35 and Ad41 short fiber shaft, that do not contain HSP 
binding sites. 

Also provided are adenoviral fiber shaft modifications that alter, 
particularly ablate viral interaction with HSP, as described above, in combination 
with fiber knob modifications that ablate viral interaction with CAR. The fiber 

10 knob modifications include: (a) mutations of individual amino acids in the fiber 
loop that interact with CAR, such as, for example, AB or CD loop modifications; 
(b) mutations of individual amino acids in the fiber loop that modify the ability of 
the CAR binding motif to interact with CAR; and (c) replacements of fiber knobs 
using adenoviruses that do not interact with CAR, such as, for example, Ad3 

15 fiber knob, Ad41 short fiber knob, or Ad35 fiber knob. 

Also provided are adenoviral fiber shaft modifications as described above 
in combination with penton modifications that ablate viral interaction with a v 
integrins. The penton modifications include: (a) mutations of individual amino 
acids that interact with or v integrins; (b) mutations of individual amino acids that 

20 modify the ability of the a v integrin binding motif to interact with the a v integrins; 
and (c) replacement of penton proteins using penton proteins from adenoviruses 
that do not interact with the a v integrins. 

Also provided are adenoviral fiber shaft modifications as described above 
in combination with fiber knob modifications as described above and penton i 

25 modifications as described above. 

Also provided is a scale-up method for the propagation of detargeted 
adenoviral vectors. The method uses polycations and/or bifunctional reagents, 
which when added to tissue culture medium results in entry of adenoviral 
particles into the producer cells. 
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Provided aer recombinant viral particles that contain a modified capid 
protein whereby binding to heparin sulfate proteoglycans (HSP) is reduced or 
eliminated compared to particles that contain unmodified capsid proteins. The 
modified capsid proteins include fibers proteins with modified shafts such that 
5 binding to HSP is reduced or eliminated. 

Among the particular embodiments the following are provided. Provided 
are adenovirus capsid proteins that are modified to alter, typically reduce or 
eliminate, binding to or interaction with in vivo and/or in vitro to heparin sulfate 
proteoglycan (HSP). HSPs are expressed on various cells, including 
10 hepatocytes. It is shown herein that HSPs provide for or participate in 

transduction of cells, such as liver cells. Since it can be desirable to eliminate or 
reduce such transduction, the modifications of the capsid proteins, such as fiber 
proteins, permit detargeting of particles that express such proteins from such 
cells. 

1 5 Thus provided are modified adenovirus fiber proteins that include a 

mutation, such as an insertion, deletion, change, replacement of amino acids or 
combinations thereof, whereby binding to or interaction with heparin sulfate 
proteoglycan (HSP) is altered. In particular, the he binding of the modified fiber 
protein is eliminated or reduced compared to the unmodified protein. Exemplary 

20 of these mutations are mutations in the shaft of a fiber, where the shaft also can 
include the tail. The mutations can reduce or alter the affinity of the fiber 
protein for HSP is reduced at least by 2-fold, 5-fold, 10-fold, 100-fold or more, 
including substantially eliminating it. 

As provided herein, fibers from adenoviruses that interact with HSP can 

25 include a motif, such as BBXB or BBBXXB, wherethe B is a basic amino acid and 
X is any amino acid, particularly the consensus sequence KKTK in Ad5 and Ad2. 
Thus, provided are fibers in which the motif is altered to eliminate or reduce 
interaction with HSP. 

Also provided are modified fiber protein of claim 1 that are chimeras in 

30 which the fiber shaft (or fiber shaft and tail) are derived from a fiber, such as 
Ad3, Ad35, Ad7, Ad1 1, Ad16, Ad21, Ad34, Ad40, Ad41 or Ad46 fiber, that 
does not interact with HSP and combined with fiber that does interact, such as 
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Ad5 or Ad2 fiber, to produce a complete fiber whose binding to HSP is reduced 
or eliminated. 

All of the modified capsids proteins provided herein also can include one 
or more further modifications that reduce or eliminate interaction of the resulting 
fiber with one or more cell surface proteins, such as but not limited to, CAR and 
cr v integrin or other receptor to which a particular native fiber binds, in addition to 
HSP. These modifications include, but are not limited to, modification to fiber 
that reduces or eliminates CAR binding and modification to penton that reduces 
or eliminate a v integrin binding. The mutations can be in the fiber knob, shaft, 
tail and shaft, and also in penton. 

Any and all of the modified capsid proteins provided herein can further 
include a ligand that binds to a particular receptor thereby endowing a fiber (or 
other capsid protein) with binding specificity or the ability to interact with such 
receptor. The ligand can be inserted into any suitable site in a capsid protein, 
such as an insertion or replacement. For example, fibers with ligands inserted 
into the knob region are exemplified. Any such ligand can be employed and a 
variety are exemplified herein. 

A variety of modified capsid proteins are exemplified herein. These 
include, but are not limited to, fibers containing: the sequence of amino acids 
set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 70 and 72; or a 
sequence of amino acids having 60%, 70%, 80%, 90%, 95% or greater 
sequence identity with a sequence of amino acids set forth in any of SEQ ID 
Nos. 52, 54, 56, 58, 62, 66, 70 and 72; or a sequence of amino acids encoded 
by a sequence of nucleotides that hybridizes under conditions of high stringency 
along at least 70% of its length to a sequence of nucleotides that encodes a 
sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 
70 and 72. 

Nucleic acids encoding the capsid proteins, including the fibers are also 
provided. The nucleic acids can be provided as vectors, particularly as 
adenovirus vectors. Many adenoviral vectors are known and can be modified as 
needed in accord with the description herein. Adenoviral vectors include, but 
are not limited to, early generation adenoviral vectors, such as E1 -deleted 
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vectors, gutless adenoviral vectors and replication-conditional adenoviral vectors, 
such as oncolytic adenoviral vectors. The adenovirus vectors also can include 
heterologous nucleic acids that encode or provide products, such as therapeutic 
products. Any therapeutic product is contemplated and a variety are set forth 
5 herein as exemplary. Heterologous nucleic acid can encode a polypeptide or 
comprise or encode a regulatory sequence, such as a promoter or an RNA, 
including RNAi, small RNAs, other double-stranded RNAs, antisense RNA, and 
ribozymes. Promoters include, for example, constitutive and regulated 
promoters and tissue specific promoter, including tumor specific promoters. 
10 The promoter can be operably linked, for example, to a gene of an adenovirus 
essential for replication. 

Cells containing the nucleic acid molecules and cells containing the 
vectors are also provided. Such cell include packaging cells. The cells can be 
prokaryotic or eukaryotic cells, including, mammalian cells, such a primate cells, 
* 15 including human cells. 

Also provided are adenoviral particles that contain the modified capsid 
proteins provided herein. The particles have altered interaction or binding with 
HSP compared to particles that do not contain the modified capsid proteins. In 
addition to altered binding to HSP, which is typically reduced or eliminated 
20 binding, the particles can include further modifications, such as capsid proteins 
with altered interaction with other receptors as described above. In particular, 
the particles can have altered, typically reduced or eliminated, interaction with 
CAR, a v integrin and/or other receptors. The mutation include mutations in the 
fiber knob, penton and hexon. Exemplary fiber know mutations are mutations in 
25 the AB loop or CD loop, such as KOI or K012, which are described herein. In 
addition, the particles can include additional ligands for retargeting to selected 
receptors. The adenoviral particles can be from any serotype and subgroup. 

Methods for expressing heterologous nucleic acids in a cell are provided. 
In these methods an adenoviral vector provided herein is transduced into a cell 
30 to deliver the nucleic acid and/or encoded products. Transduction can be 
effected in vivo or in vitro or ex vivo, and can be for a variety of purposes 
including study of gene expression and genetic therapy. The cells can be 
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prokaryotic cells, but typically are eukaryotic cells, including mammalian cells, 
such as primate, including human, cells. The cells can be of a specific type, such 
as a tumor cell or a cell in a particular tissue. The vectors can be oncolytic 
vector to effect killing of tumor cells. 
5 Since the modified capsid proteins herein have reduced or eliminated 

binding to HSP, viral particles containing such proteins exhibit ablated binding to 
HSP in vitro and in vivo. Thus provided is a method of reducing transduction of 
cells that express HSP, such as hepatocytes in the liver, by modifying a capsid 
protein, such as fiber to eliminate or reduce interaction with or binding to HSP. 

10 Such reduction reduces or eliminates transduction of cells that express HSP, 
including liver cells. 

Also provided are scale-up methods for the propagation of detargeted 
adenoviral particle, such as those provided herein. The method includes the 
steps of infecting or transducing a cell capable of replicating, maturing and 

15 packaging an adenoviral vector with a detargeted adenoviral vector in the 

presence of a reagent that results in entry of the adenoviral particle into the cell, 
such as a polycation and/or a Afunctional protein or other such reagent; and 
culturing the infected cell under conditions suitable for growth, spread and 
propagation of the adenoviral vector. The resulting adenoviral particles can be 

20 recovered. Polycations include, but are not limited to, hexadimethrine bromide, 
polyethylenimine, protamine sulfate and poly-L-lysine. Bifunctional proteins, 
include, but are not limited to, an anti-fiber antibody ligand fusion, an 
anti-fiber-Fab-FGF conjugate, an anti-penton-antibody ligand fusion, an 
anti-hexon antibody ligand fusion and a polylysine-peptide fusion. The ligand is 

25 selected to bind to a particular receptor. 

The viral particles that express a modified capsids provided herein can be 
produced by this method. The modification include, for example, one or more 
mutations selected from among mutations that reduce or eliminate interactions 
with one or more of a v integrins, coxsackie-adenovirus receptors (CAR) and 

30 heparin sulfate proteoglycans (HSP). Such mutations include, for example, PD1, 
KOI, K012 and S*. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a plasmid map for pSKOI. 

Figure 2 is a plasmid map for pNDSQ3.1 KOI . 

Figures 3A-3C are plasmid maps of pAdmireRSVnBg.(Fig. 3A), pSQ1 Fig. 
5 3B) and pSQ1K012 (Fig. 3C) 

Figure 4 is a plasmid map for pSQIPDI. 

Figures 5A-5B are plasmid maps of pSQ1FKQ1PD1 (Fig. 5A) and 
pSQ1 KOI 2PD1 (Fig. 5B). 

Figure 6 shows in vitro transduction efficiency of A549 cells using 
10 adenoviral vectors containing fiber AB loop knob and/or penton, PD1 mutations. 
The following adenoviral vectors were used in these studies: AvInBg, 
Av1 nBgFKO! , referred to as FKOI, Av1nBgPD1, referred to as PQ1 # and 
Av1 nBgFKO! PD1 that is referred to as FK01PD1. 

Figure 7A-7B shows in vivo adenoviral-mediated liver gene expression 
15 (Fig. 7A) and hexon DNA content (Fig. 7B) using adenoviral vectors containing 
fiber AB loop knob and/or penton, PD1 mutations. The following adenoviral 
vectors were used in these studies: AvInBg, Av1 nBgFK0 1, referred to as 
FKOI, Av1nBgPD1, referred to as PD1 , AvInBgFKOI PD1 , referred to as 
FK01PD1, Av1nBgKO!2, referred to as K012, and Av1 nBgKOI 2PD1 that is 
20 referred to as KOI 2PD1 . 

Figure 8 is a plasmid map for pFBshuttle(EcoRI). 
Figure 9 is a plasmid map for pSQ1 HSP. 
Figure 10 is a plasmid map for pSQIHSPKOI. 
Figure 1 1 is a plasmid map for pSQ1HSPPD1 . 
25 Figure 1 2 is a plasmid map for pSQ1 HSPKOI PD1 . 

Figures 13A-13C show the transduction efficiency of A549 and HeLa 
cells using adenoviral vectors containing fiber shaft, knob and/or penton 
mutations. Fig. 13A shows the dose response for the transduction efficiency of 
A549 cells. Fig. 13B shows the transduction efficiency of HeLa cells at 2000 
30 ppc. Figure 1 3C shows the competition analysis of adenoviral vectors 
containing fiber shaft mutations. 
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Figures 14A-14B shows the influence of fiber shaft mutations oh in vivo 
adenoviral-mediated liver gene expression (Fig. 14A) and hexon DNA content 
(Fig. 14B). 

Figures 15A-15B are plasmid maps of pSQIHSPRGD (Fig. 15A) and 
5 pSQIHSPKOIRGD (Fig. 15B). 

Figure 16 shows that insertion of a RGD targeting ligand can restore 
transduction of the vectors containing the HSP binding shaft S* mutation. 

Figures 17A-17B are plasmid maps of pSQ1 AD35Fiber (Fig. 17A) and 
pSQ1 Ad35FcRGD (Fig. 17B). 
10 Figures 18A-18B are maps of plasmids encoding 35F chimeric fibers. Fig. 

18A is a plasmid map of pSQ135T5H, and Fig. 18B is a plasmid map of 
pSQ1 5T35H. 

Figure 19 shows the results of an in vitro analysis of Ad5 vectors 
containing Ad35 fibers and derivatives thereof. 
15 Figure 20 shows the results of an in vivo analysis of Ad5 vectors 

containing Ad35 fibers and derivatives thereof. 

Figures 21A-21B are plasmid maps of pSQ1Ad41sF (Fig. 21 A) and 
pSQ1 Ad41 sFRGD (Fig. 21 B). 

Figure 22 shows the results of an in vivo analysis of Ad5 vectors 
20 containing Ad41 short fiber. 

Figure 23 shows the in vitro analysis of Ad5 based vectors containing the 
Ad41 short fiber which has been re-engineered to contain a cRGD ligand in the 
HI loop. 

Figure 24 shows enhanced transduction of AE1-2a cells with the 
25 Av3nBgFK01 detargeted adenoviral vector using hexadimethrine bromide (HB), 
protamine sulfate (PS) and poly-lysine-RGD (K14) or the anti-penton-TNFa 
Afunctional protein (apen-TNF). 

Figure 25 shows ablation of HSP interaction decreases 
adenoviral-mediated gene transfer to other organs 
30 Figure 26 shows in vivo liver transduction with adenoviral vectors which 

encode for B-galactosidase and contain various mutations to the fiber and/or 
penton proteins. Results are plotted as percent transduction as compared to 
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wild type. Two different methods for determining the level of transduction are 
shown for each vector. 

Figure 27 shows the adenoviral vector biodistribution to the liver and 
tumor for the vectors containing the S*, KOIS*, and 41 sF fibers. 
5 DETAILED DESCRIPTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
which the invention(s) belong. All patents, patent applications, published 
10 applications and publications, Genbank sequences, websites and other published 
materials referred to throughout the entire disclosure herein, unless noted 
otherwise, are incorporated by reference in their entirety. In the event that there 
, i are a plurality of definitions for terms herein, those in this section prevail. 

Where reference is made to a URL or other such identifier or address, it 
15 understood that such identifiers can change and particular information on the 
internet can come and go, but equivalent information is known and can be 
i readily accessed, such as by searching the internet and/or appropriate 

databases. Reference thereto evidences the availability and public dissemination 
of such information. 

20 As used herein, the term "adenovirus" or "adenoviral particle" is used to 

include any and all viruses that can be categorized as an adenovirus, including 
any adenovirus that infects a human or an animal, including all groups, 
subgroups, and serotypes. Depending upon the context reference to 
"adenovirus" can include adenoviral vectors. There are at least 51 serotypes of 

25 Adenovirus that classified into several subgroups. For example, subgroup A 

includes adenovirus serotypes 12, 18, and 31. Subgroup C includes adenovirus 
serotypes 1, 2, 5, and 6. Subgroup D includes adenovirus serotype 8, 9, 10, 13, 
15, 17, 19, 20, 22-30, 32, 33, 36-39, and 42-49. Subgroup E includes 
adenovirus serotype 4. Subgroup F includes adenovirus serotypes 40 and 41 . 

30 These latter two serotypes have a long and a short fiber protein. Thus, as 
used herein ah adenovirus or adenovirus particle is a packaged vector or 
genome. 
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As used herein, "virus," "viral particle," "vector particle," "viral vector 
particle," and "virion" are used interchangeably to refer to infectious viral 
particles that are formed when, such as when a vector containing all or a part of 
a viral genome, is transduced into an appropriate cell or cell line for the 
5 generation of such particles. The resulting viral particles have a variety of uses, 
including, but not limited to, transferring nucleic acids into cells either in vitro or 
in vivo. For purposes herein, the viruses are adenoviruses, including recombinant 
adenoviruses formed when an adenovirus vector, such as any provided herein, is 
encapsulated in an adenovirus capsid. Thus, a viral particle is a packaged viral 

10 genome. An adenovirus viral particle is the minimal structural or functional unit 
of a virus. A virus can refer to a single particle, a stock of particles or a viral 
genome. The adenovirus (Ad) particle is relatively complex and may be resolved 
into various substructures. 

Included among adenoviruses and adenoviral particles are any and all 

15 viruses that can be categorized as an adenovirus, including any adenovirus that 
infects a human or an animal, including all groups, subgroups, and serotypes. 
Thus, as used herein, "adenovirus" and "adenovirus particle" refer to the virus 
itself and derivatives thereof and cover all serotypes and subtypes and naturally 
occurring and recombinant forms, except where indicated otherwise. Included 

20 are adenoviruses that infect human cells. Adenoviruses can be wildtype or can 
be modified in various ways known in the art or as disclosed herein. Such 
modifications include, but are not limited to, modifications to the adenovirus 
genome that is packaged in the particle in order to make an infectious virus. 
Exemplary modifications include deletions known in the art, such as deletions in 

25 one or more of the El a, E1b, E2a, E2b, E3, or E4 coding regions. Other 

exemplary modifications include deletions of all of the coding regions of the 
adenoviral genome. Such adenoviruses are known as "gutless" adenoviruses. 
The terms also include replication-conditional adenoviruses, which are viruses 
that preferentially replicate in certain types of cells or tissues but to a lesser 

30 degree or not at all in other types. For example, among the adenoviral particles 
provided herein, are adenoviral particles that replicate in abnormally proliferating 
tissue, such as solid tumors and other neoplasms. These include the viruses 
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disclosed in U.S. Patent No. 5,998,205 and U.S. Patent No. 5,801,029. Such 
viruses are sometimes referred to as "cytolytic" or "cytopathic" viruses (or 
vectors), and, if they have such an effect on neoplastic cells, are referred to as 
"oncolytic" viruses (or vectors). 

As used herein, the terms "vector," "polynucleotide vector," 
"polynucleotide vector construct," "nucleic acid vector construct," and "vector 
construct" are used interchangeably herein to mean any nucleic acid construct 
that can be used for gene transfer, as understood by those skilled in the art. 

As used herein, the term "viral vector" is used according to its 
art-recognized meaning. It refers to a nucleic acid vector construct that includes 
at least one element of viral origin and can be packaged into a viral vector 
particle. The viral vector particles can be used for the purpose of transferring 
DNA, RNA or other nucleic acids into cells either in vitro or in vivo. Viral vectors 
include, but are not limited to, retroviral vectors, vaccinia vectors, lentiviral 
vectors, herpes virus vectors (e.g., HSV), baculoviral vectors, cytomegalovirus 
(CMV) vectors, papillomavirus vectors, simian virus (SV40) vectors, Sindbis 
vectors, semliki forest virus vectors, phage vectors, adenoviral vectors, and 
adeno-associated viral (AAV) vectors. Suitable viral vectors are described, for 
example, in U.S. Patent Nos. 6,057,155, 5,543,328 and 5,756,086. The 
vectors provided herein are adenoviral vectors. 

As used herein, "adenovirus vector" and "adenoviral vector" are used 
interchangeably and are well understood in the art to mean a polynucleotide 
containing all or a portion of an adenovirus genome. An adenoviral vector, refers 
to nucleii; encoding a complete genome or a modified genome or one that can be 
used to introduce heterologous nucleic acid when transferred into a cell, 
particularly when packaged as a particle. An adenoviral vector can be in any of 
several forms, including, but not limited to, naked DNA, DNA encapsulated in an 
adenovirus capsid, DNA packaged in another viral or viral-like form (such as 
herpes simplex, and AAV), DNA encapsulated in liposomes, DNA complexed 
with polylysine, complexed with synthetic polycationic molecules, conjugated 
with transferrin, complexed with compounds such as PEG to immunologically 
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"mask" the molecule and/or increase half-life, or conjugated to a non-viral 
protein. 

As used herein, oncolytic adenoviruses refer to adenoviruses that 
replicate selectively in tumor cells 
5 As used herein, a variety of vectors with different requirements and 

purposes are described. For example, one vector is used to deliver particular 
nucleic acid molecules into a packaging cell line for stable integration into a 
chromosome. These types of vectors also are referred to as complementing 
plasmids. A further type of vector carries or delivers nucleic acid molecules in or 
10 into a cell line {e.g., a packaging cell line) for the purpose of propagating viral 

vectors; hence, these vectors also can be referred to herein as delivery plasmids. 
A third "type" of vector is the vector that is in the form of a virus particle 
encapsulating a viral nucleic acid and that is comprised of the capsid modified as 
provided herein. Such vectors also can contain heterologous nucleic acid 
15 molecules encoding particular polypeptides, such as therapeutic polypeptides or 
regulatory proteins or regulatory sequences to specific cells or cell types in a 
subject in need of treatment. 

As used herein, the term "motif" is used to refer to any set of amino 
acids forming part of a primary sequence of a protein, either contiguous or 
20 capable of being aligned to certain positions that are invariant or conserved, that 
is associated with a particular function. The motif can occur, not only by virture 
of the primary sequence, but also as a consequence of three-dimensional folding. 
For example, the motif GXGXXG is associated with nucleotide-binding sites. In 
this fiber is a trimer, hence the trimeric structure can contribute fornation of a 
25 motif. Alternatively, a motif can be considered as a domain of a protein, where 
domain is a region of a protein molecule delimited on the basis of function 
without knowledge of and relation to the molecular substructure, as, e.g., the 
part of a protein molecule that binds to a receptor. As shown herein, the motif 
KKTK constitutes a consensus sequence for fiber shaft interaction with HSP. 
30 As used herein, the term "bind" or "binding" is used to refer to the 

binding between a ligand and its receptor, such as the binding of an Ad5 shaft 
motif with HSP (Heparin Sulfate Proteoglycans), with a K d in the range of 10-2 
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to 10-15 mole/l, generally, 10' 6 to 10" 15 , 10' 7 to 10" 15 and typically 10 8 to 10" 15 
(and/or a K a of 10 5 -10 12 , 10 7 -10 12 , 10 8 -10 12 l/mole). 

As used herein, specific binding or selective binding means that a the 
binding of a particular ligand and one receptor interaction (k a or K eq ) is at least 2- 
5 fold, generally, 5, 10, 50, 100 or more-fold, greater than for another receptor. 
A statement that a particular viral vector is targeted to a cell or tissue means 
that its affinity for such cell or tissue in a host or in vitro is at least about 2-fold, 
generally, 5, 10, 50, 100 or more-fold, greater than for other cells and tissues in 
the host or under the in vitro conditions. 
10 As used herein, the term "ablate" or "ablated" is used to refer to an 

adenovirus, adenoviral vector or adenoviral particle, in which the ability to bind 
to a particular cellular receptor is reduced or eliminated, generally substantially 
eliminated (i.e., reduced more than 10-fold, 100-fold or more) when compared to 
a coresponding wild-type adenovirus. An ablated adenovirus, adenoviral vector 
15 or adenoviral particle also is said to be detargeted, i.e., the modified adenovirus, 
adenoviral vector or adenoviral particle does not possess the native tropism of 
the wild-type adenovirus. The reduction or elimination of the ability of the 
mutated adenovirus fiber protein and/or mutated adenovirus penton protein to 
bind a cellular receptor as compared to the corresponding wild-type fiber protein 
20 and/or wild-type penton protein can be measured or assessed by comparing the 
transduction efficiency (gene transfer and expression of a marker gene) of an 
adenovirus particle containing the mutated fiber protein and/or mutated penton 
protein compared to an adenovirus particle containing the wild-type fiber protein 
and/or wild-iype penton protein for cells having the cellular receptor. 
25 As used herein, tropism with reference to an adenovirus refers to the 

selective infectivity or binding that is conferred on the particle by a capsid 
protein, such as the fiber protein and/or penton. 

As used herein, "penton" or "penton complex" is used herein to designate 
a complex of penton base and fiber. The term "penton" can also be used to 
30 indicate penton base, as well as penton complex. The meaning of the term 
"penton" alone should be clear from the context within which it is used. 
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As used herein, the term "substantially eliminated" refers to a 
transduction efficiency less than about 1 1 % of the efficiency of the wild-type 
fiber containing virus on HeLa cells. The transduction efficiency on Hela cells 
can be measured (see, e.g., Example 1 of U.S. Patent Application Serial No. 
5 09/870,203 filed on 30 May 2001, and published as U.S. Published application 
No. 20020137213, and of International Patent Application No. PCT/EP01 /06286 
filed 1 June 2001 ). Briefly, HeLa cells are infected with the adenoviral vectors 
containing mutated fiber proteins to evaluate the effects of fiber amino acid 
mutations on CAR interaction and subsequent gene expression. Monolayers of 

10 HeLa cells in 12 well dishes are infected with, for example, 1000 particles per 
cell for 2 hours at 37° C. in a total volume of, for example, 0.35 ml of the 
DMEM containing 2% FBS. The infection medium is then aspirated from the 
monolayers and I ml of complete DMEM containing 10% FBS was added per 
well. The cells are incubated for an sufficient time, generally about 24 hours, to 

15 allow for £-galactosidase expression, which is measured by a chemiluminescence 
reporter assay and by histochemical staining with a chromogenic substrate. The 
relative levels of /?-galactosidase activity are determined using as suitable 
system, such as the Galacto-Light chemiluminescence reporter assay system 
(Tropix, Bedford, Mass.) Cell monolayers are washed with PBS and processed 

20 according to the manufacturer's protocol. The cell homogenate is transferred to 
a microfuge tube and centrifuged to remove cellular debris. Total protein 
concentration is determined, such as by using the bicinchoninic acid(BCA) 
protein assay (Pierce, Inc., Rockford, III.) with bovine serum albumin as the 
assay standard. An aliquot of each sample is then incubated with the Tropix 

25 j?-galactosidase substrate for 45 minutes in a 96 well plate. A luminometer is 
used determine the relative light units (RLU) emitted per sample and then 
normalized for the amount of total protein in each sample (RLU/ug total protein). 
For the histochemical staining procedure, the cell monolayers are fixed with 
0.5% glutaraldehyde in PBS, and then were incubated with a mixture of 1 mg of 

30 5-bromo-4-chloro-3-indolyl-/?-D-galactoside (X-gal) per ml, 5 mM potassium 

ferrocyanide, 5 mM potassium ferricyanide and 2 mM MgCI 2 in 0.5 ml of PBS. 
The monolayers are washed with PBS and the blue cells are visualized by light 
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microscopy, such as with a Zeiss ID03 microscope. Generally, the efficiency is 
less than about 9%, and typically is less than about 8%. 

As used herein, the phrase "reduce" or "reduction" refers to a change in 
the efficiency of transduction by the adenovirus containing the mutated fiber as 
compared to the adenovirus containing the wild-type fiber to a level of about 
75% or less of the wild-type on HeLa cells. Generally, the change in efficiency 
is to a level of about 65% or less than wild-type. Typically it is about 55% or 
less. This system is able to rapidly analyze modified fiber proteins and/or 
modified penton proteins for desired tropism in the context of the viral particle. 

As used herein, the term "mutate" or "mutation" or similar terms refers to 
the deletion, insertion or change of at least one amino acid in the part of the 
fiber shaft region interacting with HSP. The amino acid can be changed by 
substitution or by modification in a way that derivatizes the amino acid. Thus, 
for example a BBXB motif or BBBXXB motif, where B is a basic amino acid, in an 
adenovirus is mutated to ablate the viral interaction with HSP. 

As used herein, the term "polynucleotide" means a nucleic acid molecule, 
such as DNA or RNA, that encodes a polynucleotide. The molecule can include 
regulatory sequences, and is generally DNA. Such polynucleotides are prepared 
or obtained by techniques known by those skilled in the art in combination with 
the teachings contained therein. 

As used herein, adenoviral genome is intended to include any adenoviral 
vector or any nucleic acid sequence comprising a modified fiber protein. All 
adenovirus serotypes are contemplated for use in the vectors and methods 
herein. 

As used herein, the term "viral vector" is used according to its art- 
recognized meaning. It refers to a nucleic acid vector construct that includes at 
least one element of viral origin and can be packaged into a viral vector particle. 
The viral vector particles can be used, for example, for transferring DNA into 
cells either in vitro or in vivo. 

As used herein, a packaging cell line is a cell line that is able to package 
adenoviral genomes or modified genomes to produce viral particles. It can 
provide a missing gene product or its equivalent. Thus, packaging cells can 
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provide complementing functions for the genes deleted in an adenoviral genome 
(e.g., the nucleic acids encoding modified fiber proteins) and are able to package 
the adenoviral genomes into the adenovirus particle. The production of such 
particles require that the genome be replicated and that those proteins necessary 
5 for assembling an infectious virus are produced. The particles also can require 
certain proteins necessary for the maturation of the viral particle. Such proteins 
can be provided by the vector or by the packaging cell. 

As used herein, detargeted adenoviral particles have ablated (reduced or 
eliminated) interaction with receptors with which native particles. Fully 

10 detargeted particles have two or more specificities altered. It is understood that 
in vivo no particles are fully ablated such that they do not interact with any cells. 
Degareted and fully degarated have reduced, typically substantial! reduced, or 
eliminated interaction with native receptors. For purposes herein, detargeted 
particles have reduced (2-fold, 5-fold, 10-fold, 100-fold or more) binding or 

15 virtually no binding to HSP receptors; fully degareted vectors include further 

capsid modifications to eliminate interactions with other receptors, such as CAR 
and integrins or other receptors. The particles still bind to cells, but the types of 
cells and interactions are reduced. 

As used herein, pseudotyping describes the production of adenoviral 

20 vectors having modified capsid protein or capsid proteins from a different 

serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 or Ad35 fiber protein. 
This can be accomplished by producing the adenoviral vector in packaging cell 
lines expressing different fiber proteins. As provided herein, detargeting of an 

25 adenovirus 5 particle or other serotype group C adenovirus or other adenovirus 
that binds to HSP to reduce or eliminate binding to HSPs can be effected by 
replacing all or a portion that includes the shaft or at least the HSP consensus 
binding sequence of the Ad5 fiber with an adenovirus fiber or portion thereof 
that does not bind to HSP. Adenoviruses having fiber shafts that do not interact 

30 with HSP include (a) adenoviruses of subgroup B, e.g., Ad3, Ad35, Ad7, Ad1 1, 
Ad16, Ad21, Ad34 which do not have interaction with HSP, (b) adenoviruses of 
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subgroup F, e.g., Ad40 and Ad41 , specifically the short fiber, and (c) 
adenoviruses of subgroup D, e.g., Ad46. 

As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
5 protein" can be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

As used herein, the term "cyclic RGD" (or cRGD) refers to any amino acid 
that binds to a v integrins on the surface of cells and contains the sequence RGD 
(Arg-Gly-Asp). 

10 As used herein, the term "heterologous polynucleotide" means a 

polynucleotide derived from a biological source other than an adenovirus or from 
an adenovirus of a different strain or can be a polynucleotide that is in a different 
locus from wild-type virus. The heterologous polynucleotide can encode a 
polypeptide, such as a toxin or a therapeutic protein. The heterologous 

15 polynucleotide can contain regulatory regions, such as a promoter regions, such 
as a promoter active in specific cells or tissue, for example, tumor tissue as 
found in oncolytic adenoviruses. Alternatively, the heterologous polynucleotide 
can encode a polypeptide and further contain a promoter region operably linked 
to the coding region. 

20 As used herein, reference to an amino acid in an adenovirus protein or to 

a nucleotide in an adenovirus genome is with reference to Ad5, unless specified 
otherwise. Corresponding amino acids and nucleotides in other adenovirus 
strains and modified strains and in vectors can be identified by those of skill in 
the art. Thus recitation of a mutation is intended to encompass all adenovirus 

25 strains that process a corresponding locus. 

As used herein, the KO mutations refer to mutations in fiber that knock 
out binding to CAR. For example, a KOI mutation refers to a mutation in the 
Ad5 fiber and corresponding mutations in other fiber proteins. In Ad5, this 
mutation results in a substitution of fiber amino acids 408 and 409, changing 
30 them from serine and proline to glutamic acid and alanine, respectively. As used 
herein, a K0 12 mutation refers to a mutation in the Ad5 fiber and corresponding 
mutations in other fiber proteins. In Ad5, this mutation is a four amino acid 
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substitution as follows: R512S, A515G, E516G, and K517G. Other KO 
mutations can be identified empirically or are known to those of skill in the art. 

As used herein, PD mutations refer to mutations in the penton gene that 
ablate binding by the encoded to or v integrin by replacing the RGD tripeptide. 
5 The PD1 mutation exemplified herein results in a substitution of amino acids 337 
through 344 of the Ad5 penton protein, HAIRGDTF (SEQ ID No. 9), with amino 
acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thereby replacing the RGD 
tripeptide. 

As used herein, treatment means any manner in which the symptoms of a 
10 condition, disorder or disease are ameliorated or otherwise beneficially altered. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

15 As used herein, a subject is an animal, such as a mammal, typically a 

i 

human, including patients. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 

20 into the selected target cells in a manner such that the heterologous DNA is 
expressed and a therapeutic product encoded thereby is produced, 
Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 

25 of a therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 

30 receptor therefor, that Is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
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prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, a therapeutic nuucleic acid is a nucleic acid that endes a 
therapeutic product. The product can be nucleic acid, such as a regulatory 
sequence or gene, or can encode a protein that has a therapeutic activity or 
effect. For example, therapeutic nucleic acid can be a ribozyme, antisense, 
double-stranded RNA, a nucleic acid encoding a protein and others. 

As used herein, "homologous" means about greater than 25% nucleic 
acid sequence identity, such as 25% 40%, 60%, 70%, 80%, 90% or 95%. If 
necessary the percentage homology will be specified. The terms "homology" 
and "identity" are often used interchangeably. In general, sequences are aligned 
so that the highest order match is obtained (see, e.g.: Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 
Biocomputing: Informatics and Genome Projects, Smith, D.W., ed.. Academic 
Press, New York, 1993; Computer Analysis of Sequence Data, Parti, Griffin, 
A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991; Carillo et al. (1988) SI AM J Applied Math 43:1073). 
By sequence identity, the number of conserved amino acids are determined by 
standard alignment algorithms programs, and are used with default gap penalties 
established by each supplier. Substantially homologous nucleic acid molecules 
would hybridize typically at moderate stringency or at high stringency all along 
the length of the nucleic acid or along at least about 70%, 80% or 90% of the 
full-length nucleic acid molecule of interest. Also contemplated are nucleic acid 
molecules that contain degenerate codons in place of codons in the hybridizing 

nucleic acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 
"identical" can be determined using known computer algorithms such as the 
"FAST A" program, using for example, the default parameters as in Pearson et 
al. (1988) Proc. Natl. Acad. Sci. USA 85:2444 (other programs include the GCG 
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program package (Devereux, J., et aL, Nucleic Acids Research 72(0:387 
(1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., et al., J Molec Biol 275:403 
(1990); Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San 
Diego, 1994, and Carillo et al. (1988) SI AM J Applied Math 45:1073). For 
example, the BLAST function of the National Center for Biotechnology 
Information database can be used to determine identity. Other commercially or 
publicly available programs include, DNAStar "MegAlign" program (Madison, Wl) 
and the University of Wisconsin Genetics Computer Group (UWG) "Gap" 
program (Madison WI)). Percent homology or identity of proteins and/or nucleic 
acid molecules can be determined, for example, by comparing sequence 
information using a GAP computer program {e.g., Needleman et al. (1 970) J. 
Mol. Biol. 48:443, as revised by Smith and Waterman ((1981) Adv. Appl. Math. 
2:482). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of the two sequences. Default parameters for 
the GAP program can include: (1) a unary comparison matrix (containing a value 
of 1 for identities and 0 for non-identities) and the weighted comparison matrix 
of Gribskov et al. (1986) Nucl. Acids Res. 14:6745, as described by Schwartz 
and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for 
each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no 
penalty for end gaps. Therefore, as used herein, the term "identity" represents a 
comparison between a test and a reference polypeptide or polynucleotide. 

As used herein, the term "at least 90% identical to" refers to percent 
identities from 90 to 99.99 relative to the reference polypeptides. Identity at a 
level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 1 00 amino acids are 
compared, no more than 10% (i.e., 10 out of 100) of amino acids in the test 
polypeptide differs from that of the reference polypeptides. Similar comparisons 
can be made between a test and reference polynucleotides. Such differences 
can be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they can be clustered in one or more 



WO 03/062400 



PCT/US03/02295 



-24- 

locations of varying length up to the maximum allowable, e.g. 10/100 amino 
acid difference (approximately 90% identity). Differences are defined as nucleic 
acid or amino acid substitutions, or deletions. At the level of homologies or 
identities above about 85-90%, the result should be independent of the program 
5 and gap parameters set; such high levels of identity can be assessed readily, 
often without relying on software. 

As used herein: stringency of hybridization in determining percentage 
mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65 °C 
10 2) medium stringency: 0.2 x SSPE, 0.1 % SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C 
Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e.g., Sambrook, E.F. 
Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring 
5 Harbor Laboratory Press (1989), vol. 3, p. B.13, see, also, numerous catalogs 
that describe commonly used laboratory solutions). SSPE is pH 7.4 phosphate- 
buffered 0.18 M NaCI. Further, those of skill in the art recognize that the 
stability of hybrids is determined by T m , which is a function of the sodium ion 
concentration and temperature (T m = 81 .5° C-1 6.6(log 10 [Na + ]) + 0.41 (%G + C)- 
20 600/D), so that the only parameters in the wash conditions critical to hybrid 
stability are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies can be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
limitation, procedures using conditions of low stringency are as follows (see also 
25 Shilo and Weinberg, Proc. Natl. Acad. Sci. USA 73:6789-6792 (1981)): Filters 
containing DNA are pretreated for 6 hours at 40 °C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.1% PVP, 0.1% 
Ficoll, 1% BSA, and 500 /vg/ml denatured salmon sperm DNA (10X SSC is 1.5 
M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 
30 Hybridizations are carried out in the same solution with the following 

modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100//g/ml salmon sperm 
DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 s cpm 32 P-labeled probe is 
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used. Filters are incubated in hybridization mixture for 18-20 hours at 40°C, 
and then washed for 1 .5 hours at 55 °C in a solution containing 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS. The wash solution is replaced 
with fresh solution and incubated an additional 1.5 hours at 60 °C. Filters are 
5 blotted dry and exposed for autoradiography. If necessary, filters are washed for 
a third time at 65-68 °C and reexposed to film. Other conditions of low 
stringency which can be used are well known in the art {e.g., as employed for 
cross-species hybridizations). 

By way of example and not way of limitation, procedures using 

10 conditions of moderate stringency include, for example, but are not limited to, 
procedures using such conditions of moderate stringency are as follows: Filters 
containing DNA are pretreated for 6 hours at 55 °C in a solution containing 6X 
SSC, 5X Denhart's solution, 0.5% SDS and 100/yg/ml denatured salmon sperm 
DNA. Hybridizations are carried out in the same solution and 5-20 X 10 6 cpm 

15 32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 
hours at 55 °C, and then washed twice for 30 minutes at 60 °C in a solution 
containing IX SSC and 0.1% SDS. Filters are blotted dry and exposed for 
autoradiography. Other conditions of moderate stringency which can be used 
are well-known in the art. Washing of filters is done at 37 °C for 1 hour in a 

20 solution containing 2X SSC, 0.1% SDS. 

By way of example and not way of limitation, procedures using conditions 
of high stringency are as follows: Prehybridization of filters containing DNA is 
carried out for 8 hours to overnight at 65 °C in buffer composed of 6X SSC, 
50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, 

25 and 500 //g/mi denatured salmon sperm DNA. Filters are hybridized for 48 hours 
at 65 °C in prehybridization mixture containing 100 jug/ml denatured salmon 
sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of filters is done 
at 37°C for 1 hour in a solution containing 2X SSC, 0.01 % PVP, 0.01% Ficoll, 
and 0.01 % BSA. This is followed by a wash in 0.1 X SSC at 50°C for 45 

30 minutes before autoradiography. Other conditions of high stringency which can 
be used are well known in the art. 
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The term substantially identical or substantially homologous or similar 
varies with the context as understood by those skilled in the relevant art and 
generally means at least 60% or 70%, preferably means at least 80%, 85% or 
more preferably at least 90%, and most preferably at least 95% identity. 
5 As used herein, substantially identical to a product means sufficiently 

similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous to 
appear free of readily detectable impurities as determined by standard methods 
10 of analysis, such as thin layer chromatography (TLC), gel electrophoresis and 

high performance liquid chromatography (HPLC), used by those of skill in the art 
to assess such purity, or sufficiently pure such that further purification would 
not detectably alter the physical and chemical properties, such as enzymatic and 
biological activities, of the substance. Methods for purification of the 
15 compounds to produce substantially chemically pure compounds are known to 
those of skill in the art. A substantially chemically pure compound can, 
however, be a mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 

The methods and and preparation of products provided herein, unless 
20 otherwise indicated, employ conventional techniques of chemistry, molecular 

biology, microbiology, recombinant DNA, genetics, immunology, cell biology, cell 
culture and transgenic biology, which are within the skill of the art {see, e.g., 
Maniatis et al. (1982) Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY); Sambrook et al. (198C) 
25 Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY; Ausubel et al. (1992) Current Protocols in Molecular 
Biology, Wiley and Sons, New York; Glover (1985) DNA Cloning I and II, Oxford 
Press; Anand (1992) Techniques for the Analysis of Complex Genomes 
(Academic Press); Guthrie and Fink (1991) Guide to Yeast Genetics and 
30 Molecular Biology, Academic Press; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harobor, NY; 
Jakoby and Pastan f eds. (1979) Cell Culture. Methods in Enzymology 58, 
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Academic Press, Inc., Harcourt Brace Jaovanovich, NY; Nucleic Acid 
Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells 
(R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal (1984), A Practical Guide To Molecular Cloning; Gene 
5 Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1987, 
Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 
(Wu et al. eds.); Immunochemical Methods In Cell And Molecular Biology (Mayer 
and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental 
Immunology, Volumes MV (D. M. Weir and C. C. Blackwell, eds., 1986); Hogan 

10 et al. (1986) Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. 
B. Capsid modifications 

Provided herein are modifications of the viral capsid that ablate the 
interaction of an adenovirus with its natural receptors. In particular, fiber 

15 modifications that result in ablation of the interaction of an adenvirus with HSP 
are provided. These fiber modifications can be combined with other capsid 
protein modifications, such as other fiber modifications and/or penton and/or 
hexon modifications, to fully ablate viral interactions with natural receptors, 
when expressed on a viral particle. The modification should not disrupt trimer 

20 formation or transport of fiber into the nucleus. 
1 . Fiber genes and proteins 

The fiber protein extends from the capsid and mediates viral binding to 

the cell surface by binding to specific cell receptors (Philipson et al. (1968) J. 

Virol. 2:1064-1075). The fiber is c. trimeric protein that includes an N-terminal 
25 tail domain that interacts with the adenovirus penton base, a central shaft 

domain of varying length, and a C-terminal knob domain that contains the cell 

receptor binding site (Chroboczek et al. (1995) Curr. Top. Microbiol. Immunol. 

753:163-200; Riurok et al. (1990) J.Mol.BioL 275:589-596; Stevenson et al. 

(1995) J. Virol. 55:2850-2857). The sequences of the fiber gene from a variety 
30 of serotypes including adenovirus serotypes 2 (Ad2), Ad5, Ad3, Ad35, Ad12, 

Ad40, and Ad41 are known. There are at least 21 different fiber genes in 

Genbank. 
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As noted, the fiber protein can be divided into three domains (see, e.g., 
Green et al. (1983) EMBO J. 2:1357-1365). The conserved N-terminus contains 
the sequences responsible for association with the penton base as well as a 
nuclear localization signal. A rod-like shaft of variable length contains repeats of 
a 15 amino acid beta structure, with the number of repeats ranging from 6 in 
Ad3 to 22 in Ad5. A conserved stretch of amino acids which includes the 
sequence TLWT (SEQ ID No. 36) marks the boundary between the repeating 
units of beta structure in the shaft and the globular head domain. The C-terminal 
head domain ranges in size from 1 57 amino acid residues for the short fiber of 
Ad41 to 193 residues for Ad1 1 and Ad34. The fiber spike is a homotrimer and 
it is thought that the C-terminus is responsible for trimerization of the fiber 
homotrimer and there are 1 2 spikes per virion which are attached via association 
with the penton base complex. 

2. Modification of HSP interaction 

The adenovirus fiber protein is a major determinant of adenovirus tropism 
(Gall etal. (1996) J. ViroL 70:2116-2123; Stevenson et al. (1 995) J. Virol. 
69:2850-2857). Dogma in the field has been that adenoviral entry occurs via 
binding to CAR and integrins. This is underscored by published data (Einfeld et 
al. (2001 ) J. Virology 75'A 1 284-1 1 291 ). It is shown herein, however, these 
published entry pathways are not the predominant ones that act in vivo. 
Moreover, as shown herein, the dominant entry pathway for hepatocytes in vivo 
involves a mechanism mediated by the fiber shaft, such as Ad5 shaft, through 
heparin sulfate proteoglycans binding. 

It is shown herein that elimination of this binding eliminates entry vis MSP 
binding, such as in hepatocytes. Adenoviral fiber shaft modifications that ablate 
viral interaction with HSP are provided. Thus, as provided herein, efficient 
detargeting of adenovirus in vivo can be achieved with appropriately designed 
fiber proteins. Suitable modifications, such as described herein, can be made 
with respect to any adenovirus in which the wild-type interacts with HSP. 

As provided herein, the ability of an adenoviral vector to interact with 
HSP is modified. In particular, the ability to interact is reduced or eliminated. 
Modifications include insertions, deletions, individual amino acid mutations and 
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other mutations that alter the structure of the fiber shaft such that the HSP 
binding of the modified fiber protein is ablated when compared to the HSP 
binding of the wild-type fiber protein. 

In a first aspect of this embodiment, an adenoviral fiber protein is 
5 modified by mutating one or more of the amino acids that interact with HSP. 

For example, the HSP binding motif of the modified fiber protein is no longer able 
to interact with HSP on the cell surface, thus ablating the viral interaction with 
HSP. For example, the adenoviral fiber is from a subgroup C adenovirus. 
Binding to HSP can be eliminated or reduced by mutating the fiber shaft in order 

10 to modify the ability of the HSP binding motif, which is, for example, KKTK 

sequence (SEQ ID No. 1) located between amino acid residues 91 to 94 in the 
Ad 5 fiber, to interact with HSP. The fiber proteins are modified by chemical 
and biological techniques known to those skilled in the art, such as site directed 
mutagenesis of nucleic aicd encoding the fiber or other techniques as illustrated 

15 herein. 

In another aspect of this embodiment, the ability of a fiber to interact 
with HSP is modified by replacing the wild-type fiber shaft with a fiber shaft, or 
portion thereof, of an adenovirus that does not interact with HSP to produce 
chimeric fiber proteins. The portion is sufficient to reduce or eliminate 

20 interaction with HSP. Examples of adenoviruses having fiber shafts that do not 
interact with HSP include (a) adenoviruses of subgroup B, such as, but are not 
limited to, Ad3, Ad35, Ad7, Ad1 1, Ad16, Ad21, Ad34, which do not have 
interaction with HSP, (b) adenoviruses of subgroup F, such as, but are not 
limited to, Ad40 and Ad41 , specifically Jie short fiber, and (c) adenoviruses of 

25 subgroup D, such as but are not limited to, Ad46. In another embodiment, 
adenoviral fiber shaft modifications that ablate viral interaction with HSP in 
combination with adenoviral fiber knob modifications that ablate viral interactions 
with CAR are provided. Suitable adenoviral fiber modifications include the 
fiber knob modifications are known to those of skill in the art and are exemplified 

30 herein (see, also, US. Patent Application Serial No. 09/870,203, filed on 30 May 
2001, and published as U.S. Published application No. 20020137213, in 
International Patent Application No. PCT/EP01 /06286 filed on 1 June 2001). 
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Modifications of the fiber include mutations of at least one amino acid in the CD 
loop of a wild-type fiber protein of an adenovirus from subgroup C, D, or E, or 
the long wild-type fiber of an adenovirus from subgroup F, whereby the ability of 
a fiber protein to bind to CAR is reduced or substantially eliminated. The fiber 
5 proteins with ablated CAR interaction are modified by chemical and biological 
techniques known to those skilled in the art, as illustrated herein and as 
described in the above patent application. 

Alternatively, adenoviral fiber modifications are made by replacing the 
wild-type fiber knob with a fiber knob of an adenovirus that does not interact 

10 with CAR. The fiber protein also will be selected so that it does not interact 
with HSP. Examples of adenoviruses having fiber knobs that do not interact 
with CAR include (a) adenoviruses of subgroup B, e.g., Ad3, Ad35, Ad7, Adl 1, 
Ad16, Ad21, Ad34, (b) adenoviruses of subgroup F, e.g., Ad40 and Ad41, 
specifically the short fiber. 

15 in another embodiment, adenoviral fiber shaft modifications that ablate 

viral interaction with HSP in combination with penton modifications that ablate 
viral interactions with a v integrins are provided. Suitable adenoviral penton 
modifications include the penton modifications, which are well known to those^ 
of skill in the art (see, e.g., U.S. Patent No. 5,731,190; see, also Einfeld eta!. 

20 {2001 ) J. Virology 75\\ 1 284-1 1 291 ; and Bai et al. (1 993) J. Virology 
67:5198-5205). 

For example, penton interaction with a v integrins can be ablated (reduced 
or eliminated) by substitution of the RGD tripeptide motif, required for <7 V 
interaction, in penton with a different tripeptide that does not interact with an or v 

25 integrin. The penton proteins with ablated a v integrin interactions are modified 
by chemical and biological techniques known to those skilled in the art (see, 
e.g., described U.S. Patent No. 6,731,190 and as illustrated herein). Generally, 
the adenovirus is a subgroup B or C adenovirus. 

Also provided are adenoviral fiber shaft modifications that ablate viral 

30 interaction with HSP in combination with adenoviral fiber knob modifications that 
ablate viral interactions with CAR and with penton modifications that ablate viral 
interactions with a v integrins. These modifications are described above and 
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prepared using chemical and biological techniques known to those skilled in the 
art and as illustrated herein. Generally the adenovirus is a subgroup B or 
subgroup C adenovirus. 

Preparation of fibers modified to eliminate or reduce HSP interactions and 
fibers modified to alter interactions with other receptors and cell surface 
proteins, such as CAR and/or a v integrin, is also described in the Examples 
below. The nucleic acid and/or amino acid sequences of exemplary modified 
fibers, whose construction are described below) are set forth as SEQ ID Nos. 45- 
72 as follows: 

SEQ ID Nos. 45 and 46 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK01, where 5F refers to 
Adenovirus 5 fiber, KOI is an exemplary mutation of the CAR interaction site 
described herein; 

SEQ ID Nos. 47 and 48 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified ber designated 5FK01RGD, which further 
includes an RGD ligand to demonstrate retargeting; 

SEQ ID Nos. 49 and 50 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK012, where 5F refers 
to Adenovirus 5 fiber, KOI 2 is another exemplary mutation of the CAR 
interaction site described herein; 

SEQ ID Nos. 51 and 52 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5F S* nuc, where 5F 
refers to Adenovirus 5 fiber, S* is an exemplary mutation of the shaft that alters 
binding to HSP; 

SEQ ID Nos. 53 and 54 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5F S*RGD nuc, which 
further includes an RGD ligand; 

SEQ ID Nos. 55 and 56 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified ber designated 5FK01S*, which contain 
the K01 and S* mutations; 
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SEQ ID Nos. 57 and 58 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK01S*RGD, which 
further includes an RGD ligand; 

SEQ ID Nos. 59 and 60 set forth the encoding nucleotide sequence and 
5 amino acid sequence of a Ad35 fiber; 

SEQ ID Nos. 61 and 62 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 35FRGD, which is 35F 
fiber with an RGD ligand; 

SEQ ID Nos. 63 and 64 set forth the encoding nucleotide sequence and 

10 amino acid sequence of a Ad41 short fiber; 

SEQ ID Nos. 65 and 66 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 41sFRGD, which is 41 F 
short fiber with an RGD ligand; 

SEQ ID Nos. 67 and 68 set forth the encoding nucleotide sequence and 
15 amino acid sequence of Ad5 penton; 

SEQ ID Nos. 69 and 70 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5TS35H, which is a 
chimeric fiber in which an Ad5 fiber tail and shaft regions (5TS; amino acids 1 to 
403) are connected to an Ad35 fiber head region (35H; amino acids 137 to 323) 
20 to form the 5TS35H chimera; and 

SEQ ID Nos. 71 and 72 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 35TS5H, which is a 
chimeric fiber in which an Ad35 fiber tail and shaft regions (35TS; amino acids 1 
to 136) are connected to an Ad5 fiber head region <5H; amino acids 404 to 581) 
25 to form the 35TS5H chimera. 

SEQ ID No. 1 sets forth the nucleotide sequence of Ad fiber; SEQ ID Nos. 
2 and 3 also set forth the coding nucleic acid sequencs for fibers with modified 
fiber knobs for ablated CAR interaction (see, SEQ ID No. 2 for KOI and SEQ ID 
No. 3 for KOI 2); SEQ ID No. 4 also sets for the encoding nucleic acid sequence 
30 of a modified penton for ablated a v integrins (SEQ ID No. 4). 

The modified fibers are displayed on virus particles by modifying the fiber 
protein and optionally additional proteins. This can be achieved by preparing 
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adenoviral vectors that express the modified capsid proteins and produce 

particles with modified fibers, or by packaging adenoviral vectors, particularly 

those that do not encode one or more capsid proteins in appropriate packaging 

lines. Hence, as discussed in detail below, adenoviral vectors and viral particles 

5 with modified fibers that do no bind to HSP are provided. 

C. Nucleic acids. Adenoviral vectors and cells containing the nucleic acids 
and cells containing the vectors 

Also provided are polynucleotides that encode modified capsid proteins 
and that encode vectors for preparation of adenovirus that express modified 

10 capsid proteins provided herein. The sequences of the wild-type adenovirus 

proteins are well known in the art and are modified as described herein. Nucleic 
acid molecules, such as cDNA that encode an exemplary modified fiber knob for 
ablated CAR interaction (see, SEQ ID No. 2 for KOI and SEQ ID No. 3 for K012) 
and for a modified penton for ablated a v integrins (SEQ ID No. 4) are provided. 

15 As discussed above, modified capsid proteins with altered tropism for CAR and 
a v integrins are known and described in the patents, applications and literature 
cited herein and known to those of skill in the art (see, e.g., U.S. Patent No. 
5,731,190, U.S. application Serial No. 09/870,203, published as U.S. Published 
application No. 20020137213; and Bai et a/. (1993) J. Virology 

20 67:5198-5208). 

Also provided are vectors including the polynucleotides provided herein. 
Such vectors include partial or complete adenoviral genomes and plasmids. Such 
vectors are constructed by techniques known to those skilled in the art and as 
illustrated herein. Also provided are adenoviral vectors modified by replacing 

25 whole fiber protein, or portions thereof, with the fiber proteins, or appropriate 
portions thereof, of an adenovirus that does not interact with HSP. 
Adenoviruses that do not interact with HSP can be identified by using the 
methods described herein which detect binding or non-binding of fiber proteins 
and adenoviruses with HSP. Among the adenoviral vectors provided herein are 

30 those of subgroup C, which include Ad2 and Ad5, in which the nucleic acid 
encoding the fiber shaft or a portion including the HSP-binding portion is 
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replaced with nucleic acid encoding fiber or an appropriate portion thereof from a 
serotype, such as Ad35. 

Adenoviral fiber modifications, thus, can be made in viral particles by 
replacing the entire fiber protein with the fiber protein of an adenovirus that does 
5 not interact with CAR and/or replacing the HSP binding portion with a portion 
that does not bind. Generally the adenovirus is a subgroup B or subgroup C 
adenovirus, and also an adenovirus of subgroup D, such as Ad46. Adenoviral 
vectors of subgroup C, such as Ad2 and Ad5, having a replaced fiber knob are 
prepared using techniques well known in the art and as illustrated herein. 

10 1. Preparation of viral particles 

The packaging cells used to produce the viruses provided herein contain 
the nucleic acid encoding the capsid protein, including the mutated fiber protein 
provided herein. Such nucleic acid can be transfected into the cell, generally 
part of as part of plasmid, or it can be infected into the cell with a viral vector. It 

15 can be stably incorporated into the genome of the cell, thus providing for a 

stable cell line. Alternatively, nucleic acid encoding the mutated capsid protein 
can be removed from the genome, in which case a transient complementing cell 
is employed. 

The adenovirus genome to be packaged is transferred into the 
20 complementing cell by techniques known to those skilled in the art. These 

techniques include transfection or infection with the adenovirus. The nucleic acid 
encoding the mutated fiber protein can be in this genome instead of in the 
packaging cell. 

In certain cases, when the nucleic acid encoding the mutated fiber is in 
25 the genome to be packaged, it can be desirable for the packaging cell to also 
encode a fiber protein. Such protein can assist in the maturation and packaging 
of an infectious particle. Such protein can be a wild-type fiber protein or one 
modified such that it is unable to attach to the penton base protein and is for 
use, for example, in producer cells where the fiber is included to provide the 
30 packaging function and the vector encodes a full-length fiber. 

The packaging cells are cultured under conditions that permit the 
production of the desired viral particle. The viral particles are recovered by 
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standard techniques. An exemplary method for producing adenoviral particles 
provided herein is as follows. The nucleic acid encoding the mutated fiber 
protein is made using standard techniques in an adenoviral shuttle plasmid. This 
plasmid contains the right end of the virus, in particular from the end of the E3 
region through the right ITR. This plasmid is co-transfected into competent cells 
of an E. coli strain, such as the well known E coli strain BJ5183 (see, e.g., 
Degryse (1996) Gene 7 70:45-50) along with a plasmid, which contains the 
remaining portion of the adenovirus genome, except for the E1 region and 
sometimes also the E2a region and also contains a corresponding region of 
homology. Homologous recombination between the two plasmids generates a 
full-length plasmid encoding the entire adenoviral vector genome. 

This full-length adenoviral vector genome plasmid is then transfected into 
a complementing cell line. The transfection can be performed in the presence of 
a reagent that directs adenoviral particle entry into producer cells. Such 
reagents include, but are not limited to, polycations and bifunctional reagents, 
such as those described herein. A complementing cell is, for example, is a cell 
of the PER.C6 cell line, which contains the adenoviral E1 gene (PER.C6 is 
available, for example, from Crucell, The Netherlands; deposited under ECACC 
accession no. 96022940; see, also Fallaux et al. (1998) Hum. Gene Ther. 
3:1909-1907; see, also, U.S. Patent No. 5,994,128) or an AE1-2a cell (see, 
Gorziglia et al. (1 996) J. Virology 70:4173-4178; and and Von Seggern et al. 
(1998) .7. Gen. Virol. 79:1461-1468)). 

AE1-2a cells are derivatives of the A549 lung carcinoma line (ATCC # 
CCL 185) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 
referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 41) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 42), which provide complementation of the adenoviral E1 and E2a 
functions, respectively. 

The 633 cell line (see, von Seggern et al. (2000) J. Virology 
74:354-362), which stably expresses the adenovirus serotype 5 wild-type fiber 
protein, and was derived from the AE1-2a cell line, is another an example of 
complementing cells. When the cell line is 633 cells, the final passage of 
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adenoviral vector is performed on another complementing cell line (e.g., Per.C6), 
which does not express wild-type Ad5 fiber. 

Jhe transfected complementing cells are maintained under standard cell 
culture conditions. The adenoviral plasmids recombine to form the adenoviral 
5 genome that is packaged. The particles are infectious, but replication deficient 
because their genome is missing at least the E1 genes. When performed in the 
633 cells the particles contain wild-type and mutated fiber proteins. They are 
recovered from the crude viral lysate, amplified, and are purified by standard 
techniques. 

10 The recovered particles can be used to infect PER.C6 or AE1-2a cells. 

This permits the recovery of particles whose capsids contain only the desired 
mutated fiber. This two-step procedure provides high titer batches of the 
adenoviral particles provided herein. The adenoviral particles can be replication 
competent or replication incompetent. 
15 In one embodiment, the particles selectively replicate in certain 

predetermined target tissue but are replication incompetent in other cells and 
tissues, in a particular embodiment, the adenoviral particles replicate in 
abnormally proliferating tissue, such as solid tumors and other neoplasms. In 
replication conditional adenoviruses, a gene essential for replication is placed 
20 under control of a heterologous promoter which is cell or tissue specific. For 

example, the E1a gene is placed under control of a promoter which is active in a 
tumor cell to produce an oncolytic adenovirus or oncolytic adenoviral vector. 
Administration of oncolytic adenoviral vectors to tumor cells kills the tumor cells. 
Such replication conditional adenoviral particles and vectors can be produced by 
25 techniques known to those skilled in the art, such as those disclosed in the 

above-referenced U.S. Patent Nos. 5,998,205 and 5,801,029. These particles 
and vectors can be produced in adenoviral packaging cells as disclosed above. 
Generally packaging cells are those that have been designed to limit homologous 
recombination that could lead to wild-type adenoviral particles. Such cells are 
30 well known and include the packaging cell known as PER.C6 (see, e.fir.,U.S. 

Patent Nos. 5,994,128 and 6,033,908; deposited under ECACC accession no. 
96022940). Since oncolytic vectors are replication competent in certain cell 
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types, they can be amplified in cell lines derived from said cell type without 
provision of Ad complementary genes. 

2. Adenoviral vectors and particles 

The adenovirus as used herein for production of the adenoviral vectors 
5 and particles can be of any serotype. Adenoviral stocks that can be employed 
as a source of adenovirus or adenoviral coat protein, such as fiber and/or penton 
base, can be amplified from the adenoviral serotypes 1 through 47, which are 
currently available from the American Type Culture Collection (ATCC, Rockville, 
Md.), or from any other serotype of adenovirus available from any other source. 

10 For instance, an adenovirus can be of subgroup A (e.g., serotypes 12, 18, 31), 
subgroup B (e.g., serotypes 3, 7, 1 1 , 1 4, 1 6, 21 , 34, 35), subgroup C (e.g., 
serotypes 1, 2, 5, 6), subgroup D (e.g., serotypes 8, 9, 10, 13, 15, 17, 19, 20, 
22-30, 32, 33, 36-39, 42-47), subgroup E (serotype 4), subgroup F (serotype 
40, 41), or any other adenoviral serotype. 

15 In certain embodiments, the adenovirus is a subgroup B or a subgroup C 

adenovirus. Subgroup C adenoviruses which are modified in as described herein, 
include, but are not limited to, Ad2 and Ad5. For Ad5, the mutation is made in 
the KKTK sequence (SEQ ID No. 1) located between amino acid residues 91 to 
94. The fiber proteins can be modified by chemical and biological techniques 

20 known to those skilled in the art. These methods include, but are not limited to, 
site directed mutagenesis and techniques as illustrated herein. 

The adenoviral particle generally includes a targeting ligand as described 
above. The presence of the targeting ligand permits the delivery of a gene to a 
desired ceil type which is different from the cell type that yvild-type adenovirus 

25 particles infect or the same as that a wild-type particle infects, but allowing the 
infection in a selective manner, i.e., non-target cell types are not significantly 
infected. 

The adenoviral vectors provided herein can be used to study cell 
transduction and gene expression in vitro or in various animal models. The latter 
30 case includes ex vivo techniques, in which cells are transduced in vitro and then 
administered to the animal. They also can be used to conduct gene therapy on 
humans or other animals. Such gene therapy can be ex vivo or in vivo. For in 
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vivo gene therapy, the adenoviral particles in a pharmaceutically-acceptable 
carrier are delivered to a human in a therapeutically effective amount in order to 
prevent, treat, or ameliorate a disease or other medical condition in the human 
through the introduction of a heterologous gene that encodes a therapeutic 
5 protein into cells in such human. The adenoviruses are delivered at a dose 
ranging from approximately 1 particle per kilogram of body weight to 
approximately 1 0 14 particles per kilogram of body weight. Generally, they are 
delivered at a dose of approximately 10 6 particles per kilogram of body weight to 
approximately 10 13 particles per kilogram of body weight, and typically the dose 
0 ranges from approximately 10 8 particles per kilogram of body weight to 
approximately 10 12 particles per kilogram of body weight. 

Any vectors known to those of skill in the art can be employed and used 
to produce viral particles that include fibers modified to ablate (including reduce) 
binding to HSP. Some exemplary vectors are as follows. 
5 a. Gutless vectors 

Gutted adenovirus vectors are those from which most or all viral genes 
have been deleted. They are grown by co-infection of the producing cells with a 
"helper" virus (such as using an El-deleted Ad vector), where the packaging cells 
expresses the E1 gene products. The helper virus trans- complements the 
20 missing Ad functions, including production of the viral structural proteins needed 
for particle assembly. To incorporate the capsid modifications into a gutted 
adenoviral vector capsid, the changes must be made to the helper virus as 
described herein. All the necessary Ad proteins including the modified capsid 
protein are provided by the modified helper virus, and the gutted adenovirus 
25 particles are equipped with the particular modified capsid expressed by the host 
cells. The E1a, Eb, E2a, E2b and E4 are generally required for viral replication 
and packaging. If these genes are deleted, then the packaging cell must provide 
these genes or functional equivalents. 

A helper adenovirus vector genome and a gutless adenoviral vector 
30 genome are delivered to packaging cells. The cells are maintained under 

standard cell maintenance or growth conditions, whereby the helper vector 
genome and the packaging cell together provide the complementing proteins for 
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the packaging of the adenoviral vector particle. Such gutless adenoviral vector 
particles are recovered by standard techniques. The helper vector genome can 
be delivered in the form of a plasmid or similar construct by standard 
transfection techniques, or it can be delivered through infection by a viral particle 
containing the genome. Such viral particle is commonly called a helper virus. 
Similarly, the gutless adenoviral vector genome can be delivered to the cell by 
transfection or viral infection. 

The helper virus genome can be the modified adenovirus vector genome 
as disclosed herein. Such genome also can be prepared or designed so that it 
lacks the genes encoding the adenovirus E1 A and E1B proteins. In addition, the 
genome can further lack the adenovirus genes encoding the adenovirus E3 
proteins. Alternatively, the genes encoding such proteins can be present but 
mutated so that they do not encode functional E1A, E1B and E3 proteins. 
Furthermore, such vector genome can not encode other functional early proteins, 
such as E2A, E2B3, and E4 proteins. Alternatively, the genes encoding such 
other early proteins can be present but mutated so that they do not encode 
functional proteins. 

In producing the gutless vectors, the helper virus genome is also 
packaged, thereby producing helper virus. In order the minimize the amount of 
helper virus produced and maximize the amount of gutless vector particles 
produced, the packaging sequence in the helper virus genome can be deleted or 
otherwise modified so that packaging of the helper virus genome is prevented or 
limited. Since the gutless vector genome will have an unmodified packaging 
sequence, it will be preferentially packaged. 

One way to do this is to mutate the packaging sequence by deleting one 
or more of the nucleotides comprising the sequence or otherwise mutating the 
sequence to inactivate or hamper the packaging function; One exemplary 
approach is to engineer the helper genome so that recombinase target sites flank 
the packaging sequence and to provide a recombinase in the packaging cell. The 
action of recombinase on such sites results in the removal of the packaging 
sequence from the helper virus genome. The recombinase can be provided by a 
nucleotide sequence in the packaging cell that encodes the recombinase. Such 
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sequence can be stably integrated into the genome of the packaging cell. 
Various kinds of recombinase are known by those skilled in the art, and include, 
but are not limited to, Cre recombinase, which operates on so-called lox sites, 
which are engineered on either side of the packaging sequence as discussed 
5 above (see, e.g.. U.S. Patent Nos. 5,91 9, 676, 6,080,569 and 5,91 9,676; see, 
also, e.g., Morsy and Caskey, Molecular Medicine Today, Jan. 1999, pgs. 
18-24). 

An example of a gutless vector is pAdARSVDys (Haecker et at. (1996) 
Hum Gene Ther. 7:1907-1914)). This plasmid contains a full-length human 
10 dystrophin cDNA driven by the RSV promoter and flanked by Ad inverted 
terminal repeats and packaging signals. 293 cells are infected with a 
first-generation Ad, which serves as a helper virus, and then transfected with 
purified pAdARSVDys DNA. The helper Ad genome and the pAdARSVDys DNA 
are replicated as Ad chromosomes, and packaged into particles using the viral 
15 proteins produced by the helper virus. Particles are isolated and the 

pAdARSVDys-containihg particles separated from the helper by virtue of their 
smaller genome size and therefore different density on CsCI gradients. Other 
examples of gutless adenoviral vectors are known (see, e.g., Sandig eta!. 
(2000) Proc. Natl. Acad. Sci. U.S.A. 37^:1002-7). 
20 b. Oncolytic vectors 

Briefly, oncolytic adenoviruses, which are viruses that replicate selectively 
in tumor cells, are designed to amplify the input virus dose due to viral 
replication in the tumor, leading to spread of the virus throughout the tumor 
mass. In situ replication of adenoviruses leads to cell lysis. This in situ replication 
25 permits relatively low, non-toxic doses to be highly effective in the selective 

elimination of tumor cells. One approach to achieving selectivity is to introduce 
loss-of-function mutations in viral genes that are essential for growth in 
non-target cells but not in tumor cells. (See, e.g., U.S. Patent No. 5,801,029.) 
This strategy is exemplified by the use of Addl1520, which has a deletion in the 
30 E1b-55KD gene. In normal cells, the adenoviral E1b-55KD protein is needed to 
bind to p53 to prevent apoptosis. In p53-deficient tumor cells, E1b-55K binding 
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to p53 is unnecessary. Thus, deletion of E1b-55KD should restrict vector 
replication to p53-deficient tumor cells. 

Another approach is to use tumor-selective promoters to control the 
expression of early viral genes required for replication (see, e.g., International 
PCT application Nos. WO 96/17053 and WO 99/25860). Thus, in this approach 
the adenoviruses selectively replicate and lyse tumor cells if the gene that is 
essential for replication is under the control of a promoter or other transcriptional 
regulatory element that is tumor-selective. 

For example oncolytic adenoviral vectors that contain a cancer selective 
regulatory region operatively linked to an adenoviral gene essential for adenoviral 
replication are known (see, e.g., U.S. Patent No. 5,998,205). Adenoviral genes 
essential for replication include, but are not limited to, E1a, E1b, E2a, E2b and 
E4. For example, an exemplary oncolytic adenoviral vector has a cancer 
selective regulatory region operatively linked to the E1a gene. In other 
embodiments, the oncolytic adenoviral vector has a cancer selective regulatory 
region of the present invention operatively linked to the E1a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
also can include at least one therapeutic transgene, such as, but not limited to, a 
polynucleotide encoding a cytokine such as GM-CSF that can stimulate a 
systemic immune response against tumor cells. 

Other exemplary oncolytic adenoviral vectors include those in which 
expression of an adenoviral gene, which is essential for replication, is controlled 
by E2F-responsive promoters, which are selectively transactivated in cancer 
cells. Thus, vectors that contains an adenoviral nucleic acid backbone that 
contains in sequential order: A left ITR, an adenoviral packaging signal, a 
termination signal sequence, an E2F responsive promoter which is operably 
linked to a first gene, such as E1a, essential for replication of the recombinant 
viral vector and a right ITR (see, published International PCT application No. 
WO02/06786, and U.S. Patent No. 5,998,205). 

In other embodiments, the oncolytic adenoviral vector has a cancer 
selective regulatory region operatively linked to the E1a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
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can also carry at least one therapeutic transgene, such as, but not limited to, a 
polynucleotide encoding a cytokine such as GM-CSF that can stimulate a 
systemic immune response against tumor cells. 
3. Packaging 

5 The viral particles provided herein can be made by any method known to 

those of skill in the art. Generally they are prepared by growing the adenovirus 
vector that contains nucleic acid that encodes the modified fiber protein in a 
standard adenovirus packaging cells to produce particles that express the 
modified fibers. Alternatively, the vectors do not encode fibers. Such vectors 

0 are packaged in producer cells to produce particles that express the modified 
fiber proteins. 

As discussed, recombinant adenoviral vectors generally have at least a 
deletion in the first viral early gene region, referred to as E1 , which includes the 
E1a and E1b regions. Deletion of the viral El region renders the recombinant 

5 adenovirus defective for replication and incapable of producing infectious viral 
particles in subsequently-infected target cells. Thus, to generate E1 -deleted 
adenovirus genome replication and to produce virus particles requires a system 
of complementation which provides the missing E1 gene product. E1 
complementation is typically provided by a cell line expressing E1 , such as the 
20 human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the E1 region of adenovirus, which provides E1 
gene region products to "support" the growth of E1 -deleted virus in the cell line 
(see, e.g., Graham etal., J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 
that may be usable for production of defective adenovirus having a portion of 

25 the adenovirus E4 region have been reported (WO 96/22378). Multiply deficient 
adenoviral vectors and complementing cell lines have also been described (WO 
95/34671, U.S. Patent No. 5,994,106). 

For example, copending U.S. application Serial No. 09/482,682 (also 
filed as International PCT application No. PCT/EPOO/00265, filed January 14, 

30 200, published as International PCT application No. WO/0042208) provides 

packaging cell lines that support viral vectors with deletions of major portions of 
the viral genome, without the need for helper viruses and also provides cell lines 
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and helper viruses for use with helper-dependent vectors. The packaging cell 
line has heterologous DNA stably integrated into the chromosomes of the cellular 
genome. The heterologous DNA sequence encodes one or more adenovirus 
regulatory and/or structural polypeptides that complement the genes deleted or 
mutated in the adenovirus vector genome to be replicated and packaged. 
Packaging cell lines express, for example, one or more adenovirus structural 
proteins, polypeptides, or fragments thereof, such as penton base, hexon, fiber, 
polypeptide Ilia, polypeptide V, polypeptide VI, polypeptide VII, polypeptide VIII, 
and biologically active fragments thereof. The expression can be constitutive or 
under the control of a regulatable promoter. These cell lines are particularly 
designed for expression of recombinant adenoviruses intended for delivery of 
therapeutic products. For use herein, such packaging cell lines can express the 
modified capsid proteins, such as the fiber proteins who binding to HSP is 
reduced or eliminated, and/or the modified penton and hexon proteins. 

Particular packaging cell lines complement viral vectors having a deletion 
or mutation of a DNA sequence encoding an adenovirus structural protein, 
regulatory polypeptides E1 A and E1B, and/or one or more of the following 
regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. 

The packaging cell lines are produced by introducing each DNA molecule 
into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
introduced via. a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 
such as Ad 37, polypeptide or fragment thereof. 1 

For applications, such as therapeutic applications, the delivery plasmid 
further can include a nucleotide sequence encoding a heterologous polypeptide. 
Exemplary delivery plasmids include, but are not limited to, pDV44, pAE!B/?-gal 
and pAEisplB. In a similar or analogous manner, therapeutic nucleic acids, 
such as nucleic acids that encode therapeutic genes, can be introduced. 
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The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 
chromosome(s) of the cellular genome of the cell. 

Typically, the packaging cell lines will contain nucleic acid encoding the 
5 fiber protein or modified protein stably integrated into a chromosome or 

chromosomes in the cellular genome. The packaging cell line can be derived from 
a procaryotic cell line or from a eukaryotic cell line. While various embodiments 
suggest the use of mammalian cells, and more particularly, epithelial cell lines, a 
variety of other, non-epithelial cell lines are used in various embodiments. Thus, 
10 while various embodiments disclose the use of a cell line selected from among 
the 293, A549, W162, HeLa, Vero, 21 1 , and 21 1A cell lines, and any other cell 
lines suitable for such use are likewise contemplated herein. 
D. Addition of a targeting ligand 

The viral particles that are detargeted as described herein, can be 
15 retargeted to selected cells and/or tissues by inclusion of an appropriate 

targeting ligand in the capsid. The ligand cam be included in any of the capsid 
proteins, such as fiber, hexon and penton. Loci for inclusion of nucleic acid 
encoding a is known to those of skill in the art for a a variety of adenovirus 
serotypes; if necessary appropriate loci and other parameters can be empirically 

20 determined. 

The ligand can be produced as a fusion by inclusion of the coding 
sequences in the nucleic acid encoding a capsid protein, or chemically 
conjugated, such as via ionic, covalent or other interactions, to the capsid or 
bound to the capsid (e.g.. by Ab-ligand fusion, where Ab binds capsid protein; or 
25 by disulfide bonding or other crosslinking moieties or chemistries). 

Thus, for example, a modified fiber nucleic acid also can include 
sequences of nucleotides that encode a targeting ligand to produce viral particles 
that include a targeting ligand in the capsid. Targeting ligand and methods for 
including such Hgands in viral capids are well known. For example, inclusion of 
30 targeting ligands in fiber proteins is described in U.S. Patent Nos. 5,543,328 and 
5,756,086 and in U.S. Patent Application Serial No. 09/870,203, published as 
U.S. Published application No. 20020137213, and International Patent 
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Application No. PCT/EP01 /06286. For different serotypes and strains of 
adenoviruses, loci for insertion of targeting ligands can be empirically 
determined. For different serotypes and strains, such loci can vary. 

Because the adenovirus fiber has a trimeric structure, the ligand can be 
5 selected or designed to have a trimeric structure so that up to three molecules of 
the ligand are present for each mature fiber. Such ligands can be incorporated 
into the fiber protein using methods known in the art (see, e.g., U.S. Patent No. 
5,756,086). Instead of the fiber, the targeting ligand can be included in the 
penton or hexon proteins. Inclusion of targeting ligands in penton (see for 

10 example, in U.S. Patent Nos. 5,731,190 and 5,965,4-31) and in hexon (see for 
example, in U.S. Patent No. 5,965,541) is known. 

In one exemplary embodiment, the ligand is included in a fiber protein, 
which is a fiber protein mutated as described herein. As shown herein, the 
targeting ligand can be included, for example, within the HI loop of the fiber 

15 protein. Any ligand that can fit in the HI loop and still provide a functional virus 
is contemplated herein. Such ligands can be as long as or longer than 80-100 
amino acids (see, e.g., Belousova eta/. (2002) J. Virol. 75:8621-8631). Such 
ligands are added by techniques known in the art (see, e.g., published 
International Patent Application publication No. W099/39734 and U.S. Patent 

20 Application number 09/482,682). Other ligands can be be discovered through 
techniques known to those skilled in the art. Some non-limiting examples of 
these techniques include phage display libraries or by screening other types of 
libraries. 

Targeting ligands include any chemical moiety that preferentially directs 
25 an adenoviral particle to a desired cell type and/or tissue. The categories of such 
ligands include, but are not limited to, peptides, polypeptides, single chain 
antibodies, and multimeric proteins. Specific ligands include the TNF superfamily 
of ligands which include tumor necrosis factors (or TNF's) such as, for example, 
TNFa and TNF)?, lymphotoxins (LT), such as LT-or and LT-£, Fas ligand which 
30 binds to Fas antigen; CD40 ligand, which binds to the CD40 receptor of 

B-lymphocytes; CD30 ligand, which binds to the CD30 receptor of neoplastic 
cells of Hodgkin's lymphoma; CD27 ligand, NGF ligand, and OX-40 ligand; 
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transferrin, which binds to the transferrin receptor located on tumor cells, 
activated T -cells, and neural tissue cells; ApoB, which binds to the LDL receptor 
of liver cells; alpha-2-macroglobulin, which binds to the LRP receptor of liver 
cells; alpha-l acid glycoprotein, which binds to the asialoglycoprotein receptor of 
5 liver; mannose-containing peptides, which bind to the mannose receptor of 
macrophages; sialyl-Lewis-X antigen-containing peptides, which bind to the 
ELAM-I receptor of activated endothelial cells; CD34 ligand, which binds to the 
CD34 receptor of hematopoietic progenitor cells; ICAM-I, which binds to the 
LFA-I (CD1 1b/CD1 8) receptor of lymphocytes, or to the Mac-I (CD1 1a/CD1 8) 
10 receptor of macrophages; M-CSF, which binds to the c-fms receptor of spleen 
and bone marrow macrophages; circumsporozoite protein, which binds to 
hepatic Plasmodium falciparum receptor of liver cells; VLA-4, which binds to the 
VCAM-I receptor of activated endothelial cells; HIV gp120 and Class II MHC 
antigen, which bind to the CD4 receptor of T -helper cells; the LDL receptor 
15 binding region of the apolipoprotein E (ApoE) molecule; colony stimulating factor, 
or CSF, which binds to the CSF receptor; insulin-like growth factors, such as 
IGF-I and IGF-II, which bind to the IGF-I and IGF-II receptors, respectively; 
Interleukins 1 through 14, which bind to the Interleukin 1 through 14 receptors, 
respectively; the Fv antigen-binding domain of an immunoglobulin; gelatinase 
20 (MMP) inhibitor; bombesin, gastrin-releasing peptide; substance P; somatostatin; 
luteinizing hormone releasing hormone (LHRH); vasoactive peptide (VIP); gastrin; 
melanocyte stimulating hormone (MSH); cyclic RGD peptide and any other ligand 
or cell surface protein-binding (or targeting) molecule. 
E. Heterologous polynucleotides and Therapeutic Nucleic Acids 
25 The packaged adenoviral genome also can contain a heterologous 

polynucleotide that encodes a product of interest, such as a therapeutic protein. 
Adenoviral genomes containing heterologous polynucleotides are well known 
(see, e.g., U.S. Patent Nos. 5,998,205, 6,156,497, 5,935,935, and 
5,801 ,029). These can be used for in vitro and in vivo delivery of the products 
30 of heterlogous polynucleoties or the heterologous polynucleotides. 

Thus, the adenoviral particles provided herein can be used to engineer a 
cell to express a protein that it otherwise does not express or does not express 
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in sufficient quantities. This genetic engineering is accomplished by infecting the 
desired cell with an adenoviral particle whose genome includes a desired 
heterologous polynucleotide. The heterologous polynucleotide is then expressed 
in the genetically engineered cells. For use herein the cell is generally a 
5 mammalian cell, and is typically a primate cell, including a human cell. The cell 
can be inside the body of the animal (in vivo) or outside the body (in vitro). 
Heterologous polynucleotides (also referred to as heterologous nucleic acid 
sequences) are included iri the adenoviral genome within the particle and are 
added to that genome by techniques known in the art. Any heterologous 
10 polynucleotide of interest can be added, such as those disclosed in U.S. Patent 
No. 5,998,205, incorporated herein by reference. Polynucleotides that are 

introduced into an Ad genome of vector can be any that encode a protein of 
interest or that are regulatory sequences. Proteins include, but are not limited 
to, therapeutic proteins, such as an immunostimulating protein, such as an 
15 interleukin, interferon, or colony stimulating factor, such as granulocyte 
macrophage colony stimulating factor (GM-CSF; see, e.g., 5,908,763F. 
Generally, such GM-CSF is a primate GM-CSF, including human GM-CSF. Other 
immunostimulatory genes include, but are not limited to, genes that encode 
cytokines IL1, IL2, IL4, IL5, IFN, IFN, TNF, IL1 2, IL18, and flt3), proteins that 
20 stimulate interactions with immune cells (B7, CD28, MHC class I, MHC class II, 
TAPs), tumor-associated antigens (immunogenic sequences from MART-1, 
gp100(pmel-17), tyrosinase, tyrosinase-related protein 1 , tyrosinase-related 
protein 2, melanocyte-stimulating hormone receptor, MAGE1, MAGE2, MAGE3, 
MAGE12, BAGE, GAGE, NY-ESO-1, -catenin, MUM-1, CDK-4, caspase 8, KIA 
25 0205, HLA-A2R1701, -fetoprotein, telomerase catalytic protein, G-250, MUC-1, 
carcinoembryonic protein, p53, Her2/neu, triosephosphate isomerase, CDC-27, 
LDLR-FUT, telomerase reverse transcriptase, and PSMA), cDNAs of antibodies 
that block inhibitory signals (CTLA4 blockade), chemokines (MIP1, MIP3, CCR7 
ligand, and calreticulin), and other proteins. 
30 Other polynucleotides, including therapeutic nucleic acids, such as 

therapeutic genes, of interest include, but are not limited to, anti-angiogenic, and 
suicide genes. Anti-angiogenic genes include, but are not limited to, genes that 
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encode METH-1 , METH -2, TrpRS fragments, proliferin-related protein, prolactin 
fragment, PEDF, vasostatin, various fragments of extracellular matrix proteins 
and growth factor/cytokine inhibitors. Various fragments of extracellular matrix 
proteins include, but are not limited to, angiostatin, endostatin, kininostatin, 
5 fibrinogen-E fragment, thrombospondin, tumstatin, canstatin, and restin. Growth 
factor/cytokine inhibitors include, but are not limited to, VEGF/VEGFR 
antagonist, sFlt-1 , sFlk, sNRP1 , angiopoietin/tie antagonist, sTie-2, chemokines 
(IP-10, PF-4, Gro-beta, IFN-gamma (Mig), IFN, FGF/FGFR antagonist (sFGFR), 
Ephrin/Eph antagonist (sEphB4 and sephrinB2), PDGF, TGF and IGF-1. 
10 A "suicide gene" encodes a protein that can lead to cell death, as with 

expression of diphtheria toxin A, or the expression of the protein can render cells 
selectively sensitive to certain drugs, e.g., expression of the Herpes simplex 
thymidine kinase gene (HSV-TK) renders cells sensitive to antiviral compounds, 
such as acyclovir, gancyclovir and FIAU (1-(2-deoxy-2-fluoro--.beta.- 
15 D-arabinofuranosil)-5-iodouracil). Other suicide genes include, but are not limited 
to, genes that encode carboxypeptidase G2 (CPG2). carboxylesterase (CA), 
cytosine deaminase (CD), cytochrome P450 (cyt-450), deoxycytidine kinase 
(dCK), nitroreductase (NR), purine nucleoside phosphorylase (PNP), thymidine 
phosphorylase (TP), varicella zoster virus thymidine kinase (VZV-TK), and 
20 xanthine-guanine phosphoribosyl transferase (XGPRT). Alternatively, a 

therapeutic nucleic acid can exert its effect at the level of RNA, for instance, by 
encoding an antisense message of ribozyme, a protein that affects splicing or 3' 
processing (e.g., polyadenylation), or a protein that affects the level of 
expression of another gene within the cell, e.g. by mediating an altered rate of 
25 mRNA accumulation, an alteration of mRNA transport, and/or a change in 

post-transcriptional regulation. The addition of a therapeutic nucleic acid to a 
virus results in a virus with an additional antitumor mechanism of action. Thus, 
a single entity (i.e., the virus carrying a therapeutic transgene) is capable of 
inducing multiple antitumor mechanisms. Other encoded proteins, include, but 
30 are not limited to, herpes simplex virus thymidine kinase (HSV-TK), which is 
useful as a safety switch (see, U.S. Patent Application No. 08/974,391 , filed 
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November 19, 1997, which published as PCT Publication No. WO/9925860), 

Nos, FasL, and sFasR (soluble Fas receptor). 

Also contemplated are combinations of two or more transgenes with 

synergistic, complementary and/or nonoverlapping toxicities and methods of 
5 action. The resulting adenovirus can retain the viral oncolytic functions and, for 

example, additionally are endowed with the ability to induce immune and 

anti-angiogenic responses and other responses as desired. 

Therapeutic polynucleotides and heterologous polynucleotides also 

include those that exert an effect at the level of RNA or protein. These include 
10 include a factor capable of initiating apoptosis, RNA, such as RNAi and other 

double-stranded RNA, antisense and ribozymes, which among other capabilities 

can be directed to mRNAs encoding proteins essential for proliferation, such as 

structural proteins, transcription factors, polymerases, genes encoding cytotoxic 

proteins, genes that encode an engineered cytoplasmic variant of a nuclease 
15 (e.g. RNase A) or protease (e.g. trypsin, papain, proteinase K and carboxy- 

peptidase). Other polynucleotides include a cell or tissue specific promoters, 

such as those used in oncolytic adenoviruses (see, e.g., U.S. Patent No. 

5,998,205). 

The heterologous polynucleotide encoding a polypeptide also can contain 
20 a promoter operably linked to the coding region. Generally the promoter is a 
regulated promoter and transcription factor expression system, such as the 
published tetracycline-regulated systems, or other regulatable systems (WO 
01/30843), to allow regulated expression of the encoded polypeptide. 
Exemplary of other promoters, are tissue-selective promoters, such as those 
25 described in U.S. Patent No. 5,998,205. An exemplary regulatable promoter 
system is the Tet-On( and Tet-Off( systems currently available from Clontech 
(Palo Alto, CA). This promoter system allows the regulated expression of the 
transgene controlled by tetracycline or tetracycline derivatives, such as 
doxycycline. This system can be used to control the expression of the encoded 
30 polypeptide in the viral particles and nucleic acids provided herein. Other 
regulatable promoter systems are known (see, e.g., published U.S. No. 
20020168714, entitled "Regulation of Gene Expression Using Single-Chain, 
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Monomeric, Ligand Dependent Polypeptide Switches," which describes gene 
switches that contain ligand binding domains and transcriptional regulating 
domains, such as those from hormone receptors). Other suitable promoters that 
can be employed include, but are not limited to, adenoviral promoters, such as 
5 the adenoviral major late promoter and/or the E3 promoter; or heterologous 
promoters, such as the cytomegalovirus (CMV) promoter; the Rous Sarcoma 
Virus (RSV) promoter; inducible promoters, such as the MMT promoter, the 
metallothlonein promoter; heat shock promoters; the albumin promoter; and the 
ApoAl promoter. 

10 Therapeutic transgenes can be included in the viral constructs and 

resulting particles. Among these are those that result in an "armed" virus. For 
example, rather than delete E3 region as in some embodiments described herein, 
all or a part of the E3 region can be preserved or re-inserted in an oncolytic 
adenoviral vector (discussed above). The presence of all or a part of the E3 

15 region can decrease the immunogenicity of the adenoviral vector. It also 

increases cytopathic effect in tumor cells and decreases toxicity to normal cells. 
Typically such vector expresses more than half of the E3 proteins. 

Adenoviruses for therapy, including those for human therapy, are known. 
Such known viruses can be modified as provided herein to reduce or eliminate 

20 interaction with HSPs and optionally additional receptors. The adenoviral vectors 
that are used to produce the viral particles can include other modifications. 
Modifications include modifications to the adenovirus genome that is packaged 
in the particle in order to make an adenoviral vector. As discussed above, 
adenovirus vectors and particles with a variety of modifications are available. 

25 Modifications to adenvoiral vectors include deletions known in the art, such as 
deletions in one or more of the El, E2a, E2b, E3, or E4 coding regions. These 
adenoviruses are sometimes referred to as early generation adenoviruses, include 
those with deletions of all of the coding regions of the adenoviral genome 
("gutless" adenoviruses, discussed above) and also include replication-condi- 
30 tional adenoviruses, which are viruses that replicate in certain types of cells or 

tissues but not in other types as a result of placing adenoviral genes essential for 
replication under control of a heterologous promoter (discussed above; see, also 
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U.S. Patent No. 5,998,205, U.S. Patent No. 5,801,029; U.S. patent application 
60/348,670 and corresponding published International PCT application No. 
WO02/06786). These include the cytolytic, cytopathic viruses {or vectors), 
including the oncolytic viruses discussed above. 

Alternatively, as discussed above, the vector can include a mutation or 
deletion in the E1b gene. Typically such mutation or deletion in the E1b gene is 
such that the E1b-19kD protein becomes non-functional. This modification of 
the E1b region can be combined with vectors where all or a part of the E3 region 
is present. 

The oncolytic adenoviral vector can further include at least one 
heterologous coding sequence, such as one that encodes a therapeutic product. 
The heterologous coding sequence, such as therapeutic gene, is generally, 
although not necessarily, in the form of cDNA, and can be inserted at any locus 
that does not adversely affect the infectivity or replication of the vector. For 
example, it can be inserted in an E3 region in place of at least one of the 
polynucleotide sequences that encode an E3 protein, such as, for example, the 
19kD or 14.7 kD E3 gene. 
F. Propagation and Scale-up 

Since doubly ablated adenoviral vectors containing mutations in the fiber 
and/or penton capsid proteins result in inefficient cell binding and entry via the 
CAR/ov integrin entry pathway, scaled up technologies improve the growth and 
propagation of such vectors to produce high titers of the adenoviral vectors for 
clinical use. Thus, also provided is a method for scaling up the production of 
detargeted adenoviral vectors. The detargeted adenoviral vectors comprise an 
adenoviral vector modified to ablate the interaction of said vector with at least 
one host cell receptor compared with a wild-type adenoviral vector. The 
detargeted adenoviral vectors can comprise an adenoviral vector modified to 
ablate the interaction of said vector with one, two, three or more host cell 
receptors. Thus, the method is suitable for producing the detargeted adenoviral 
vectors disclosed herein. 

As noted, growth and propagation of doubly and fully ablated adenoviral 
vectors is enhanced by new scale up technologies. Doubly ablated vectors 
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contain mutations in the fiber and penton capsid proteins that result in inefficient 
cell binding and entry via the normal cellular entry pathway using CAR and 
integrins. These vectors are fully detargeted in vitro and, thus, alternative 
cellular entry strategies allow for the efficient growth and generation of high titer 
preparations. 

Two strategies have been envisioned to scale up vectors that are 
detargeted via fiber and/or penton modifications. These include: (a) the use of 
pseudoreceptor cell lines engineered to express a surface receptor that binds a 
ligand displayed on the vector (see, e.g., International PCT application No. WO 
98/54346) and (b) complementing cell lines that are engineered to express 
native fiber and that can be engineered to express native fiber and penton (see, 
e.g.. International PCT application No. WO 00/42208). Although these systems 
have shown promise for scaling up ablated adenoviral vectors, there is a need to 
develop a system for the simple, efficient production of the fully detargeted 
adenoviral vector for therapeutic uses. 

Provided herein is a scale-up method for the propagation of detargeted 
adenoviral vectors. The method uses polycations and/or Afunctional reagents, 
which when added to tissue culture medium, bind adenoviral particles and direct 
their entry into the producer cells. 

Reagents (also called medium additives) also can be included in the tissue 
culture medium containing producer cells to be infected with the detargeted 
adenoviral vectors. Alternatively the reagents can be pre-mixed with the virus, 
which mixture is then added to the tissue producer cells. The reagents can be 
added to tissue culture medium containing producer cells, or producer cells can 
be added to tissue culture medium containing the reagents. Any suitable 
producer cell known to the skilled artisan can be used in the present methods. 
The reagents can be added at the same time that the producer cells are infected 
with detargeted adenoviral vectors. Generally the reagents are present in the 
tissue culture medium prior to infection by the detargeted adenoviral vectors. 
The medium additives are maintained in the tissue culture medium during vector 
growth, spread and propagation. High titer yields of adenoviral vectors are 
obtained by this method. 
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Reagents which are useful in this method are those that are capable of 
directing adenoviral particle entry into the producer cells. Such reagents include, 
but are not limited to, polycations and bifunctional reagents. Suitable 
polycations include, but are not limited to, polytheylenimine; protamine sulfate; 
5 poly-L-lysine hydrobromide; poly{dimethyl diallyl ammonium) chloride 

(Merquat(r)-IOO, Merquat(r)280, Merquat(r)550); poIy-L-arginine hydrochloride; 
poly-L-histidine; poly(4-vinylpyridine), poly(4-vinylpyridine) hydrochloride; 
polyf4-vinylpyridine)cross-linked, methylchloride quaternary salt; poly(4-vinyl- 
pyridine-co-styrene); poly(4-vinylpyridinium poly(hydrogen fluoride)); poly(4-vinyl- 

10 pyridinium-P-toluene sulfonate); poly(4-vinylpyridinium-tribromide); po!y(4-vinyl- 
pyrrolidone-co-2-dimethylamino-ethyl methacrylate); polyvinylpyrrolidone, 
cross-linked; poly viny!pyrroiidone> poly(melamine-co-formaldehyde); partially 
methylated; hexadimethrine bromide; poly(G!u, Lys) 1 :4 hydrobromide; poly(Lys, 
Ala) 3:1 hydrobromide; poly(Lys, Ala) 2:1 hydrobromide; poly-L-lysine 

15 succinylated; poly(Lys, Ala) 1:1 hydrobromide; and poly(Lys, Trp) 1:4 
hydrobromide. 

Suitable bifunctional reagents include, but are not limited to, antibodies or 
peptides that bind to the adenoviral capsid and that also contain a ligand that 
allows interaction with specific cell surface receptors of the producer cells. 
20 Examples of bifunctional reagents include: (a) anti-fiber antibody ligand fusions, 
(b) anti-fiber-Fab-FGF conjugate, (c) anti-penton-antibody ligand fusions, (d) 
anti-hexon antibody ligand fusions and (e) polylysine-peptide fusions. The ligand 
is any ligand that will bind to any cell surface receptor found on the producer 
cells. 

25 

The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 
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EXAMPLE 1 

Construction of Ad5 Vectors Containing the Fiber AB Loop. KOI and Penton, 
PD1 Mutations and Derivatives Thereof 

Three recombinant adenoviral vectors were prepared that contain the KOI 
fiber or PD1 penton base mutations either alone or in combination, these vectors 
are designated Av3nBgFK01 Av1nBgPD1, and Av1 nBgFKC-1 PD1 . Construction 
of these vectors is described below and a general description of each vector is 
set forth in Table 1 . 

TABLE 1 
Description Of Detargeted 
Recombinant Adenoviral Vectors Used For Scale-up 

Vector 



Vector j 


Description 


Av3nBg 


An E1 , E2a, E3-de!eted adenoviral vector encoding a nuclear 
localizing £-galactosidase 


AvInBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing £-galactosidase 


Av3nBgFK01 


The same as Av3nBg but containing the KOI mutation in the 
fiber gene 


Av1nBgPD1 


The same as AvInBg but containing the PD1 mutation in the 
penton gene 


AvInBgFKOI PD1 


The same as AvInBg but containing the fiber KOI and penton 
PD1 mutations 



15 



20 AvInBg 

This is a well-known vector, its sequence is set forth in SEQ ID No. 43. 
Av3nBg 

This is a well-known vector, its sequence is set forth in SEQ ID No. 44. 
Av3nBgFK01 

25 Genetic incorporation of the KOI fiber mutation to generate Av3nBgFK01 

The adenoviral vector Av3nBgFK01 was generated in an E1-, E2a-, 
E3-deleted backbone based on the adenovirus serotype 5 genome. It contains a 
RSV promoted nuclear-localizing 0-galactosidase gene in place of the E1 region. 
In addition, the fiber gene carries the KOI mutation. This mutation results in a 



WO 03/062400 PCT/US03/02295 



-55- 

substitution of fiber amino acids 408 and 409, changing them from serine and 
proline to glutamic acid and alanine, respectively. 

The vector was constructed as follows. First, the plasmid pSKO! (Figure 
1) was digested with the restriction enzymes Sphl and Muni. The resulting DNA 
5 fragments were separated by electrophoresis on an agarose gel. The 1601 bp 
fragment containing all but the 5' end of the fiber gene was excised from the 
agarose gel and the DNA was isolated and purified. The fragment was then 
ligated with the 9236 bp fragment of p5FloxHRFRGD, which had been digested 
with Sphl and Muni. The resulting plasmid, pBFIoxHRFKOI , was digested with 

10, Spel and Pad and the 6867 bp fragment containing the fiber gene was isolated. 
The fragment was ligated with the 24,630 bp Spel-Pacl fragment of pNDSQ3.1 . 
The resulting plasmid, pNDSQ3.1K01 (Figure 2), was used together with 
pAdmireRSVnBg (Figure 3A) to generate a plasmid which encodes the full-length 
adenoviral vector genome. It, however, was necessary to remove the Pad site 

15 from pNDSQ3.1K01 (Figure 2) prior to recombination with pAdmireRSVnBg 

(Figure 3A) so that the final plasmid contains a unique Pad site adjacent to the 
5' ITR. The Pad site in pNDSQ3.1K01 was removed by digestion with Pad 
followed by blunting with T4 DNA Polymerase and religation. The resulting 
plasmid was called pNDSQ3.1 KOI (Pac. 

20 To generate a full-length plasmid containing the entire adenoviral genome, 

pAdmireRSVnBg (Figure 3A) was digested with Sail and co-transfected into 
competent cells of the E. coli strain BJ5183 along with pNDSQ3.1 K01 APac, 
which had been digested with BstBl. Homologous recombination between the 
two plasmids generated a full-length plasmid encoding the entire adenoviral 

25 vector genome, which was called pFLAv3nBgFK01 . 

The plasmid pFLAv3nBgK01 was linearized with Pad and transfected into 
633 cells. In the fiber complementing 633 cell line, the resulting viral DNA 
containing the KOI mutation is capable of being packaged into infectious viral 
particles containing a mixture of wildtype fiber and mutant fiber proteins. After 

30 five rounds of amplification in 633 cells, a cytopathic effect was observed. 
Three more rounds of amplification in 633 cells were performed followed by 
purification of the virus by standard CsCI centrifugation procedures. This viral 
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preparation was used to infect AE1-2a cells, which do not express fiber. The 
resulting virus contained only the mutant fiber protein on its capsid. Virus 
particles were purified by standard CsCI centrifugation procedures. 
AvInBgFKOI 

5 The v ector AvInBgFKOI is made in a similar manner to Av3nBgFK01 

described above. 
Av1nBgK012 

An additional fiber AB loop mutation (described by Einfeld et al. (2001) J. 
Virology 75'A 1 284-1 1 291 ) was incorporated into the genome of AvInBg. This 
10 AB loop mutation is a four amino acid substitution, R512S, A515G, E516G, and 
K517G, and is referred to as K012. The K012 mutation was incorporated into 
the fiber gene by PCR gene overlap extension using the plasmid pSQ1 (Figure 
3B) as template. The pSQ1 plasmid contains most of the Ad5 genome, 
extending from base pair 3329 through the right ITR, in a pBR322 backbone. 
15 First, a segment of the Ad5 genome extending from within the E3 region into the 
fiber gene was amplified by PCR using the plasmid pSQ1 as a template with the 
following primers termed 5FF, 5'-GAA CAG GAG GTG AGC TTA GA-3' SEQ ID 
No. 4), and 5FR, 5'-TCC GCC TCC ATT TAG TGA ACA GTT AGG AGA TGG 
AGC TGG TGT G-3' (SEQ ID No. 6). The primer 5FR contains an 18 base 
20 5'-extension that encodes the modified fiber AB loop amino acids from 51 2 
through 517. A second PCR using pSQ1 as a template amplified the region 
immediately 3' of the AB loop substitution and extending past the Muni site 
located 40 base pairs 3' of the fiber gene stop codon. The two primers used for 
this reaction were 3FF: 5'-TCA CTA A AT GGA GGC GGA GAT GCT AAA CTC. 
25 ACT TTG GTC TTA AC-3' (SEQ ID No. 7), and 3FR: 5'-GTG GCA GGT TGA 
ATA CTA GG-3' (SEQ ID No. 8). The primer 3FR contains an 18 base 
5'-extension that encodes the modified fiber AB loop amino acids 512 through 
517. Amplified products of the expected size were obtained and used in a 
second PCR with the end primers 5FF and 3FR to join the fragments together. 
30 The K012 PCR fragment was digested with Xbal and Muni cloned directly into 
the fiber shuttle plasmid, pFBshuttle(EcoRl) to generate the plasmid pFBSEK012 
which contains the 8.8kB EcoRI fragment of pSQ1 . The pFBSEKQ12 plasmid 
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was digested with Xbal and EcoRI and cloned into pSQ1 using a three-way 
ligation to generate pSQ1K012 (Figure 3C). The K012 cDNA was incorporated 
into the genome of AvInBg, an adenovirus vector with E1 and E3 deleted 
encoding £-galactosidase, by homologous recombination between Clal-linearized 
5 pSQ1K012 and pAdmireRSVnBg digested with Sail and Pad to generate 
Av1nBgK012. The K012 vector was transfected in 633 cells, scaled-up on 
non-fiber expressing cells and purified, as described above for KOI. 
Av1nBgPD1 

Genetic incorporation of the PD1 penton mutation to generate 
10 Av1nBgPD1 

The adenoviral vector Av1nBgPD1 is an E1-, E3-deleted vector based on 

the adenovirus serotype 5 genome. It contains a RSV promoted 

nuclear-localizing jff-galactosidase gene in the E1 region and also contains the 

PD1 mutation in the penton gene. The PD1 mutation results in a substitution of 

15 amino acids 337 through 344 of the penton protein, HAIRGDTF (SEQ ID No. 9), 
with amino acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thus replacing the RGD 
tripeptide (see, Einfeld et aL (2001 ) J. Virology 75:11284-11291). The 
mutation in the penton gene was generated in the plasmid pGEMpenS, which 
contains the Adenovirus serotype 5 penton gene. To generate the mutation, 

20 four oligonucleotides were synthesized. The sequences of the oligonucleotides 
were as follows: penton 1 : 5' CGC GGA AGA GAA CTC CAA C.GC GGC AGC 
CGC GGC AAT GCA GCC GGT GGA GGA CAT GAA 3' (SEQ ID No. 1 1 ); penton 
2: 5' TAT CGT TCA TGT CCT CCA CCG GCT GCA TTG CCG CGG CTG CCG 
CGT TGG AGT TCT CTT CC 3' (SEQ ID No. 12); penton 3: 5' CGA TAG CCG 

25 CGG CTA CCC CTA CGA CGT GCC CGA CTA CGC GGG CAC CAG CGC CAC 

ACG GGC TGA GGA GAA GCG CGC 3' (SEQ ID No. 13); penton 4: 5' TCA GCG 
CGC TTC TCC TCA GCC CGT GTG GCG CTG GTG CCC GCG TAG TCG GGC 
ACG TCG TAG GGG TAG CCG CGG C 3' (SEQ ID No. 14). The complementary 
oligonucleotides penton 1 and penton 2 were annealed to each other as were 

30 penton 3 and penton 4. The duplex generated by annealing penton 3 and penton 
4 encoded the substitution of amino acids 337 through 344 described above. 
The duplex generated by annealing penton 1 and penton 2 possessed a 5 base 
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5' overhang which was compatible to a 5 base 5' overhang on the duplex 
generated by annealing penton 3 and penton 4. The opposite end of the duplex 
generated by annealing penton 1 and penton 2 contained an Earl compatible 
overhang. The opposite end of the duplex generated by annealing penton 3 and 
5 penton 4 contained a BbvCI compatible overhang. The two duplexes were 

ligated to each other and ligated back into the pGEMpen5 backbone as follows. 
First, pGEMpen5 was digested with BbvCI and Pstl and the resulting DNA 
fragments were separated by electrophoresis on an agarose gel. The 3360 bp 
fragment was excised from the gel and purified. The plasmid pGEMpen5 was 
10 also digested with Pstl and Earl and the resulting fragments were separated by 
electrophoresis on an agarose gel. The 955 bp fragment was excised from the 
gel and purified. .These two fragments from the pGEMpen5 plasmid were ligated 
with the two pairs of annealed oligonucleotides to generate the plasmid 
pGEMpen5PD1 . 

15 The mutated penton gene was transferred from pGEMpen5PD1 to pSQ1 

using a 5-way ligation as follows. First, the region of the penton gene 
containing the PD1 mutation was excised from P GEMpen5PD1 by digestion with 
Pvul and Ascl. The 974 bp fragment containing the PD1 mutation was purified. 
• Four DNA fragments were prepared from the pSQ1 plasmid (Figure 3B) as 

20 follows. The plasmid was digested with Csp45l and Fsel and the 9465 bp 

fragment was purified. In addition pSQ1 was digested with Fsel and Pvul and 
the 2126 bp fragment was purified. The plasmid pSQ1 was digested with Ascl 
and BamHI and the 5891 bp fragment was purified. Finally, pSQ1 was digested 
with BamHI and Csp45l and the 14610 bp fragment was purified. The 5 purified 

25 DNA fragments were ligated to each other to form the plasmid pSQ1 PD1 (Figure 
4). 

To generate adenoviral vector, pSQ1PD1 was linearized by digestion with 
Clal and co-transfected into PerC6 cells with pAdmireRSVnBg (Figure 3A) which 
had been digested with Sail and Pact, hexadimethrine bromide was maintained 
30 in the medium at 4 A/g/ml. When a cytopathic effect was observed, a crude viral 
lysate was further expanded on PerC6 cells. The virus was purified by standard 
CsCI centrifugation procedures. 



WO 03/062400 



PCT/US03/02295 



-59- 

AvInBgFKOI PD1 

Genetic incorporation of the fiber KOI or KOI 2 mutation in combination 
with the penton PD1 mutation to generate Av1 nBgFK0 1 PD1 

The adenoviral vectors Av1 nBgFKOI PD1 and Av1 nBgKOl 2PD1 were 

generated in an E1-, E3-deleted adenovirus serotype 5 genome. Both vectors 

contains a RSV promoted nuclear-localizing yff-galactosidase gene in the E1 region 

and also contains either the KOI or KOI 2 mutation in the fiber gene as well as 

the PD1 mutation in the penton gene. The vectors were constructed as follows. 

First, the plasmid pSQ1PD1 was digested with Csp45l and Spel and the 23976 

bp fragment containing the PD1 mutated penton gene was purified. In addition, 

the plasmids pSQIKOI or pSQ1K012 (Figure 3B) were digested with Csp45l 

and Spel and the 9090 bp fragment containing the K01 or K012 mutated fiber 

gene were purified. The appropriate purified fragments were ligated to each 

other to from the plasmid pSQ1FK01PD1 (Figure 5A) or pSQ1KO!2PD1 (Figure 

5B) that contains the KOI (or K012) mutated fiber gene and the PD1 mutated 

penton gene. To generate virus, pSQl FKOI PD1 orpSQKO!2PD1 was linearized 

with Clal and co-transfected into 633 cells with pAdmireRSVnBg (Figure 3A) 

which had been digested with Sail and Pad. After three rounds of amplification 

in 633 cells a cytopathic effect was observed and the crude viral lysate was 

then amplified on PerC6 cells. Hexadimethrine bromide was maintained in the 

medium at 4 //g/ml. Each virus was purified by standard CsCI centrifugation 

procedures. 

EXAMPLE 2 

In Vitro Evaluation of Adenoviral Vectors Containing the KOI and PD1 Mutations 

Several recombinant adenoviral vectors were used in these studies to 
demonstrate the function of the KOI fiber mutation and included AvInBg, 
Av1 nBgFKOI , Av1nBgPD1, and Av1 nBgFKOI PD1 , described above. The 
transduction efficiencies of adenoviral vectors containing the KOI and/or PD1 
mutations were evaluated on cells of the alveolar epithelial cell line A549. The 
transduction efficiencies were compared to that of AvInBg, an adenoviral vector 
containing wild type fiber and penton. 
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The day prior to infection, cells were seeded into 24-well plates at a 
density of approximately 1 x 10 5 cells per well. Immediately prior to infection, 
the exact number of cells per well was determined by counting a representative 
well of cells. Each of the vectors, AvInBg, Av1 nBgFKOI , and Av1 nBgFKOI PD1 
5 were used to transduce A549 cells at each of the following particle per cell 
(PPC) ratios: 100, 500, 1000, 2500, 5000, 10,000. The cell monolayers were 
stained with X-gal 24 hours after infection and the percentage of cells 
expressing £-galactosidase was determined by microscopic observation and 
counting of cells. Transductions were done in triplicate and three random fields 
10 in each well were counted, for a total of nine fields per vector. 

The results at the 500 PPC ratio are shown in Figure 6 and show a 
significantly reduced transduction efficiency on A549 cells using vectors 
containing the KOI mutation alone or when combined with PD1 compared to 
AvInBg. The vectors containing the PD1 mutation alone had no effect on 
15 adenoviral transduction of A549 cells in vitro. 

EXAMPLE 3 

In Vivo Analysis of Adenoviral Vectors Containing the FK01 and PD1 Mutations 

This Example provides experiments that evaluate the in vivo 
biodistribution of adenoviral vectors containing the KOI and PD1 mutations and 

20 their influence on adenoviral-mediated liver transduction. The results show that 
ablating the viral interaction with CAR and/or integrins is not sufficient to fully 
detarget adenoviral vectors from the liver in vivo. 

A positive control cohort received AvInBg and a negative control group 
received HBSS. Additionally, the Av1 nBgFKOI 2 and Av1 nBgFKOI 2PD1 vectors 

25 were analyzed in vivo. These vectors each contain a fiber protein with the four 
amino acid substitution in the AB loop. Additionally, Av1 nBgFKOI 2PD1 
contains a mutation in the penton base. Both of these mutations were known 
(see, Einfeld et al. (2001 ) J. Virology 75:1 1 284-1 1 291 ), and were alleged to 
decrease liver transduction 10 to 700 fold, respectively. Cohorts of five 

30 C57BL/6 mice received each vector via tail vein injection at a dose of 1 x 10 13 

particles per kg. The animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, and 



WO 03/062400 PCT7US03/02295 



-61- 

kidney were collected from each animal. The median lobe of the liver was placed 
in neutral buffered formalin to preserve the sample for /?-galactosidase 
immunohistochemistry. In addition, tissue from each organ was frozen to 
preserve it for hexon PCR analysis to determine vector content. A separate 
5 sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent ^-galactosidase activity assay. 

For yS-galactosidase immunohistochemistry slices of liver, approximately 
2-3 mm thick, were placed in 10% neutral buffered formalin. After fixation, 
these samples were embedded in paraffin, sectioned, and analyzed by 

10 immunohistochemistry for /?-galactosidase expression. A 1 :1 200 dilution was 
used of a rabbit anti-£-galactosidase antibody (ICN Pharmaceuticals, Inc.; Costa 
Mesa, CA) in conjunction with a Vectastain ABC kit (Vector Laboratories, Inc., 
Burlingame, CA) to visualize positive cells. 

The chemiluminescent /?-galactosidase activity assay was performed 

15 using the Galacto-Light PlusTM chemiluminescent assay (Tropix, Inc., Foster 
City, CA) system. Tissue samples were collected in lysis matrix tubes 
containing two ceramic spheres (Bio101, Carlsbad, CA) and frozen on dry ice. 
The tissues were thawed and 500 jj) of lysis buffer from the Galacto-Light Plus 
kit was added to each tube. The tissue was homogenized for 30 seconds using a 

20 FastPrep System (Bio101, Carlsbad, CA). Liver samples were homogenized for 
an additional 30 seconds, jff-galactosidase activity was determined in the liver 
homogenates according to the manufacture's protocol. 

For hexon PCR analysis DNA from tissues was isolated using the Qiagen 
Blood and Cell Culture DNA Midi or Mini Kits (Qiagen Ire, Chatsworth, CA). 

25 Frozen tissues were partially thawed and minced using sterile disposable 

scalpels. Tissues were then lysed by incubation overnight at 55° C in Qiagen 
buffer G2 containing 0.2 mg/ml RNaseA and 0.1 mg/ml protease. Lysates were 
vortexed briefly and then applied to Qiagen-tip 100 or Qiagen-tip 25 columns. 
Columns were washed and DNAs were eluted as described in the manufacturer's 

30 instructions. After precipitation, DNAs were dissolved in water and the 

concentrations were spectrophotometrically determined (A260 and A280) on a 



WO 03/062400 



PCT/US03/02295 



-62- 

DU-600 (Beckman Coulter, Inc.; Fullerton, CA) or a SPECTRAmax PLUS 
(Molecular Devices, Inc.; Sunnyvale, CA) spectrophotometer. 2.3.2. 

PCR primers and a Taqman probe specific to adenovirus hexon sequences 
were designed using Primer Express software v. 1 .0 (Applied Biosystems, Foster 
5 City, CA). Primer and probe sequences were: 

Hexon Forward primer: 5'-CTTCGATGATGCCGCAGTG-3' (SEQ ID No. 38); 
Hexon Reverse primer: 5'-GGGCTCAGGTACTCCGAGG-3' (SEQ ID No. 39); and 
Hexon Probe: 5'-FAM-TTACATGCACATCTCGGGCCAGGAC-TAMRA-3' (SEQ ID No. 
40). 

10 Amplification was performed in a reaction volume of 50 (j\ under the 

following conditions: 10 ng (tumor) or 1 /jg (liver and lung) of sample DNA, 1X 
Taqman Universal PCR Master Mix (Applied Biosystems), 600 nM forward 
primer, 900 nM reverse primer and 100 nM hexon probe. Thermal cycling 
conditions were: 2 minute incubation at 50° C, 10 minutes at 95° C, followed by 
5 35 cycles of successive incubation at 95° C for 1 5 seconds and 60° C for 1 
minute. Data was collected and analyzed using the 7700 Sequence Detection 
System software v. 1.6.3 (Applied Biosystems). Quantification of adenovirus 
copy number was performed using a standard curve that includes dilutions of 
adenovirus DNA from 1,500,000 copies to 15 copies in the appropriate 
20 background of cellular genomic DNA. For analysis of tumor tissues, a standard 
curve in a background of 10 ng human DNA was generated. For analysis of 
mouse liver and lung tissues, a standard curve using the same adenovirus DNA 
dilutions in a background of 1 fjq CD-1 mouse genomic DNA was generated. 
Samples were amplified in triplicate, and the average number of total copies was 
25 normalized to copies per cell based on the input DNA weight amount and a 
genome size of 6 x 10 9 bp. 

The results of the £-galactosidase activity assay and adenoviral hexon 
DNA content for liver transduction by these vectors are shown in Figure 7A and 
7B. The vector containing the K01 or K012 mutations alone showed, on 
30 average, a slight increase in liver transduction compared to AvInBg, which is 
consistent with several previous experiments. The vectors containing the PD1 
mutation alone or combined with KOI or K012 showed a slight decrease in liver 
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transduction compared to AvInBg, suggesting that integrins are involved to 
some extent in hepatic uptake of the adenoviral vectors. 

The results of the immunohistochemical staining of liver sections for 
/?-ga!actosidase were consistent with the activity assays (data not shown) and 
demonstrate that gene expression was localized specifically to hepatocytes. The 
vectors containing the KOI or K012 mutation alone showed a slight increase in 
liver transduction as revealed by a more intense and frequent 
immunohistochemical-staining pattern. The vectors containing the PD1 
mutation, either alone or combined with KOI or K012, showed little difference 
in transduction compared to AvInBg. These results demonstrate that 
ablating the viral interaction with CAR and/or integrins is not sufficient to fully 
detarget adenoviral vectors from the liver in vivo. 

In summary, the fiber AB loop mutation contained in Avl nBgFKOI or 
Av1nBgK012 ablates interaction with human and mouse CAR in vitro and 
diminished transduction in vitro. In vivo, however, fiber AB loop mutations 
behaved unexpectantly, because such mutations were found to enhance 
adenoviral-mediated gene transfer to liver and results in increasing vector 
potency. The penton base, PD1 mutation that ablates interaction with the 
second receptor involved in adenoviral internalization had no effect in vitro and 
little to no effect in vivo. These studies indicated that other receptors are 
responsible for adenoviral gene transfer to the liver in vivo. 

EXAMPLE 4 

Description Of Adenoviral Vectors Containing A Fiber With Amino Acid 
Substitutions At The Heparin Sulfate Binding Domain In The Fiber Shaft 

Vectors containing substitutions at all four of the amino acids in the four 

amino acid motif in the Ad5 fiber shaft (residues 91 to 94, KKTK; SEQ ID No. 1) 

were generated in order to ablate the potential interaction with HSP. The 

mutation is termed HSP because it potentially eliminates binding to heparan 

sulfate proteoglycans. Vectors containing the HSP mutation alone and combined 

with the KOI mutation (fiber knob AB loop mutation that ablates CAR binding), 

the PD1 mutation (penton mutation that eliminates RGD/integrin interaction), and 

a triple knockout vector (HSP, KOI, PD1) were generated. 
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Generation of the HSP fiber mutation: The HSP mutation was 
incorporated into the fiber gene by using a PCR-based strategy of gene splicing 
by overlap extension (PCR SOEing). First, a segment of the Ad5 genome 
extending from within the E3 region into the 5' end of the fiber gene was 
5 amplified by PCR using the plasmid pSQ1 (Figure 3B) as a template and two 
primers termed 5FF and 5HSPR. The DNA sequence of 5FF is as follows: 5' 
GAA CAG GAG GTG AGC TTA GA 3' (SEQ ID No. 5). This sequence 
corresponds to base pairs 25,199 - 25,218 of pSQ1 . The DNA sequence of 
5HSPR is as follows: 5' GGC TCC GGC TCC GAG AGG TGG GCT CAC AGT 

10 GGT TAC ATT T 3' (SEQ ID No. 15). 5HSPR is a reverse primer for 5FF and 
corresponds to a region in the fiber shaft adjacent to the KKT-K (SEQ ID No. 1) 
region. The primer contains a 5' extension that encodes a GAGA substitution 
for the native KKTK (encoded by SEQ ID No. 1) amino acid sequence. A second 
PCR using pSQ1 as a template amplified the region immediately 3' of the KKTK 

15 (SEQ ID No. 1) site and extending past the Muni site located 40 base pairs 3' of 
the stop codon for the fiber gene. The two primers used for this reaction were 
3HSPF and 3FR. The DNA sequence of 3HSPF is as follows: 5' GGA GCC GGA 
GCC TCA AAC ATA AAC CTG GAA AT 3' (SEQ ID No. 1 6). It contains a 5' 
extension that is complementary to the 5' extension of 5HSPR. The DNA 

20 sequence of 3FR is as follows: 5' GTG GCA GGT TGA ATA CTA GG 3' (SEQ ID 
No. 8). 

The two PCR products were joined by PCR SOEing using primers 5FF and 
3FR. The resulting PCR product was digested with the restriction enzymes Xbal 
and Muni. The 2355 bp fragment was gel purified and ligated with the 6477 bp 

25 Xbal to Muni fragment of the plasmid pFBshuttle(EcoRI) (Figure 8) to generate 
the plasmid pFBSEHSP. The plasmid pFBshuttle(EcoRI) was generated by 
digesting the plasmid pSQ1 with EcoRI, then gel purifying and self-ligating the 
8.8 kb fragment containing the fiber gene. Next, the fiber gene containing the 
HSP mutation was transferred from pFBSEHSP into pSQ1 using a three-way 

30 ligation. The 1 6,431 bp EcoRI to Ndel fragment of pSQ1 , the 9043 bp Nde! to 
Xbal fragment of pSQ1 , and the 7571 bp Xbal to EcoRI fragment of pFBSEHSP 
were isolated and ligated to generate pSQIHSP (Figure 9). 
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To generate a recombinant adenoviral vector containing the HSP mutation 
in the fiber gene, pSQIHSP was digested with Clal and pAdmireRSVnBg (Figure 
3A) was digested with Sail and Pad, then the two digested plasmids were 
co-transfected into 633 cells (von Seggern et al. (2000) J Virology 74:354-362). 
5 Homologous recombination between the two plasmids generated a full-length 
adenoviral genome capable of replication in 633 cells, which inducibly express 
Ad5 E1A and constitutively express wild-type fiber protein. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP mutation, 

10 the viral preparation was used to infect PerC6 cells, which do not express fiber. 
The resulting virus, termed Av1 nBgFS*, was purified by standard CsCI 
centrifugation procedures. 

Generation of vector containing the HSP and KOI mutations 

To generate an adenoviral vector containing the HSP and K01 mutations 

15 in fiber, a PCR SOEing strategy identical to the one described above was used 
except that the plasmid pSQIFKOI was used as the template. The PCR SOEing 
product was digested with Xbal and Muni and ligated with the 6477 bp Xbal to 
Muni fragment of pFBshuttle(EcoRI) to generate pFBSEHSPKOI . The fiber gene 
containing the HSP and K01 mutations was transferred from pFBSEHSPKOI into 

20 the pSQ1 backbone using a three-way ligation strategy identical to the one 
described above for the HSP mutation alone, to generate the plasmid 
pSQIHSPKOI (Figure 10). Recombinant adenoviral vector containing the HSP 
and KOI mutations in the fiber gene was generated by co-transfecting 
pSQIHSPKOI digested with Clal and pAdmireRSVnBg digested with Sail and 

25 Pad into 633 cells. Adenovirus was propagated and purified as described above 
for the vector containing the HSP mutation alone. The resulting virus was 
termed Av1 nBgFKOI S*. 
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Generation of vector containing the HSP and PD1 mutations 

The following strategy was used to generate a recombinant adenoviral 
vector containing the fiber HSP mutation and the penton PD1 mutation. The 
plasmid pSQ1PD1 (Figure 4) was digested with the restriction enzymes Csp45l 
and Spel and the 23,976 bp fragment was isolated and purified. In addition, the 
plasmid pSQIHSP was also digested with Csp45l and Spel and the 9090 bp 
fragment was isolated and purified and ligated to the 23,976 bp fragment to 
generate the plasmid pSQ1HSPPD1 (Figure 11), which contains the fiber HSP 
and penton PD1 mutations. An adenoviral vector was generated, propagated, 
and purified as described above. The resulting virus was termed Av1nBgS*PD1. 
Generation of vector containing the HSP, KOI, and PD1 mutations 
To generate an adenoviral vector containing the HSP, K01, and PD1 
mutations the following strategy was used. First, the plasmid pSQ1PD1 was 
digested with Csp45l and Spel and the 23,976 bp fragment was isolated and 
purified. In addition, the plasmid pSQIHSPKOI was digested with Csp45l and 
Spel and the 9090 bp fragment was isolated and purified. The two DNA 
fragments were ligated to form the plasmid pSQ1 HSPKOI PD1 (Figure 12). 
Recombinant adenoviral vector was generated, propagated, and purified as 
described above. The resulting virus was termed Av1 nBgFKOI S*PD1 . 

EXAMPLE 5 

In Vitro Evaluation Of Adenoviral Vectors Containing The HSP Fiber Mutation 

The transduction efficiencies of adenoviral vectors containing the HSP 
mutation in the fiber gene, either alone or combined with the KOI and/or PD1 
mutations, were evaluated on A549 and HeLa cells. The transduction 
efficiencies were compared to that of AvInBg, an adenoviral vector containing 
wild type fiber and penton. The day prior to infection, cells were seeded into 
24-well plates at a density of approximately 1 x 10 s cells per well. Immediately 
prior to infection, the exact number of cells per well was determined by counting 
a representative well of cells. Each of the vectors, AvInBg (see, Stevenson et 
at. (1997) J. Virol. 77:4782-4790), Av1 nBgS*. Av1 nBgFKOI S*, 
Av1nBgS*PD1, and Av1 nBgFKOI S*PD1 , were used to transduce A549 cells at 
each of the following particle per cell (PPC) ratios: 100, 500, 1000, 2500, 
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5000, 10,000. HeLa cells were transduced with each of the above vectors, as 
well as a vector containing the KOI mutation alone (AvInBgFKOD and a vector 
containing the PD1 mutation alone (Av1 nBgPDl ) at 2000 PPC. The cell 
monolayers were stained with X-gal 24 hours after infection and the percentage 
of cells expressing £-galactosidase was determined by microscopic observation 
and counting of cells. Transductions were done in triplicate and three random 
fields in each well were counted, for a total of nine fields per vector. 

The results (depicted in Figures 13A-13B) showed significantly reduced 
transduction efficiencies on A549 and HeLa cells using vectors containing the 
HSP mutation compared to AvInBg. The vectors containing the HSP mutations, 
however, demonstrated a dose response on A549 cells, in that increasing PPC 
ratios yielded increasing transduction. 

Competition experiments were done to determine which receptor molecular 
interactions are involved in transduction of A549 cells by the various vectors. 
Transductions were performed in the presence or absence of various competitors 
including Ad5 fiber knob, a 50 amino acid oligopeptide derived from Adenovirus 
serotype 2 penton base which spans the RGD tripeptide region, or heparin 
(Invitrogen Life Technologies, Gaithersburg, MD). Monolayers of A549 cells 
were cultured in Richters medium supplemented with 10% FBS and were 
transduced with AvInBg, AvInBgS*, Av1 nBgFKOI S*, Av1nBgS*PD1, or 
AvI nBgFKOI S*PD1 in infection medium (IM, Richters medium plus 2% FBS). 
Different PPC ratios were used for the different vectors to achieve measurable 
transduction levels. The PPC ratios were as follows: AvInBg: 500 PPC, 
AvInBgS*: 10,000 PPC, Av1 nBgFKOI S*: 20,000 PPC, Av1 nBgS*PP1 : 10,000 
PPC, and Av1 nBgFKOI S*PD1: 20,000 PPC. Fiber knob competition was 
performed by pre-incubating cells in IM containing 1 6 jt/g/ml of fiber knob for 10 
minutes at room temperature prior to infection with virus. Penton base peptide 
competition was performed by pre-incubating cells in IM containing 500nM 
peptide for 10 minutes at room temperature prior to infection with virus. 
Heparin competition was performed by pre-incubating each adenoviral vector in 
IM containing 3 mg/ml of heparin for 20 minutes at room temperature. In all 
cases, the competitor remained in the IM during the 1 hour infection when virus 
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was rocked on the cell monolayers at 37° C in 5% C02. After infection, the 
monolayers were washed with PBS, 1 ml of complete medium was added per 
well and the cells were incubated for an additional 24 hours to allow for 
yff-galactosidase expression. The cell monolayers were then fixed and stained 
5 with X-Gal. The percentage of cells transduced was determined by light 

microscopy as described above. Each condition was carried out in triplicate and 
three random fields per well were counted, for a total of nine fields per 
condition. The average percentage of transduction per high-power field was 
determined. 

10 The results of the competition experiment (Figure 1 3C) showed that fiber 

knob inhibited transduction of cells by all vectors except for those that contained 
the KOI mutation. The penton base peptide only inhibited transduction by 
Av1 nBgFKOI S*. Heparin inhibited transduction by AvInBgFKOIS* and 
Av1nBgFK01S*PD1 , but did not affect transduction by any of the other viruses 

1 5 suggesting the presence of additional heparin binding sites on the adenoviral 
capsid but that the shaft contains the predominant site. 

EXAMPLE 6 

In Vivo Analysis Of Adenoviral Vectors Containing The HSP Mutation In Fiber 

The objective of this study was to evaluate the in vivo biodistribution of 
20 adenoviral vectors containing the HSP mutation and to determine whether this 
shaft modification influences adenoviral-mediated liver transduction. In addition, 
vectors containing the HSP mutation combined with KOI, or PD1, or a 
combination of all three mutations were evaluated as well as vectors containing 
the K01 mutation alone and the PD1 mutation alone. A positive control cohort 
25 received AvInBg and a negative control group received HBSS. Cohorts of five 
C57BL/6 mice received each vector via tail vein injection at a dose of 1 x 10 13 
particles per kg. The animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, and 
kidney were collected from each animal. The median lobe of the liver was placed 
30 in neutral buffered formalin to preserve the sample for /?-galactosidase 

immunohistochemistry. In addition, tissue from each organ was frozen to 
preserve it for hexon real time PCR analysis to determine vector content. A 
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separate sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent yff-galactosidase activity assay. /?-galactosidase 
immunohistochemistry, hexon real-time PCR and the chemiluminescent 
y?-galactosidase activity assay were carried out as described in Example 3. 
5 The results of the £-galactosidase activity assay (Figure 14A) and adenoviral 
hexon DNA content (Figure 14B) showed a dramatic reduction in liver 
transduction by vectors containing the HSP mutation. The vectors containing 
the HSP mutation alone resulted in reducing adenoviral-mediated liver gene 
expression by approximately 20-fold. When combined with the KOI mutation 

10 (HSP, KOI, PD1), yielded approximately a 1000-fold reduction in y?-galactosidase 
activity in the liver compared to the control vector Av1 nBg. The vector 
containing the KOI mutation alone showed a slight increase, on average, in liver 
transduction compared to AvInBg, which is consistent with several previous 
experiments. The vectors containing the PD1 mutation alone or combined with 

15 KOI showed a slight decrease in liver transduction compared to AvInBg, 
although the decrease was not statistically significant. Analysis of hepatic 
adenoviral hexon DNA content (Figure 14B) confirmed these results. 

The results of the immunohistochemical staining of liver sections f or /?- 
galactosidase were consistent with the activity assays (data not shown) and 

20 demonstrated that gene expression was localized specifically to hepatocytes. 
Vectors containing the HSP mutation, either alone or in combination with KOI 
and/or PD1, showed a dramatic reduction in hepatocyte transduction. The 
vector containing the KOI mutation alone showed a slight increase in liver 
transduction as revealed by a more intense and frequent immunohistochrmical 

25 staining pattern. The vectors containing the PD1 mutation, either alone or 
combined with KOI, showed little difference in transduction compared to 
AvInBg. 

EXAMPLE 7 

Description of Adenoviral Vectors Containing the HSP Fiber Shaft Mutation with 
30 and without the KOI Fiber Mutation and with and without a cRGD Targeting 
Ligand in the Fiber Knob HI Loop 
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Generation of vector containing the HSP fiber shaft mutation and a cRGD ligand 

in the HI loop: The following strategy was used to generate an adenoviral vector 

containing a fiber with the HSP shaft mutation and a cRGD ligand in the HI loop. 

The plasmid p5FloxHRFRGD was digested with the restriction enzymes BstXI 

5 and Kpnl and the 1157 bp fragment was isolated and purified. In addition, the 

fiber shuttle plasmid pFBSEHSP, described in Example 1 above, was digested 

with BstXI and Kpnl and the 4549 bp and 3156 bp fragments were isolated and 

purified. The three fragments were ligated to generate the plasmid 

pFBSEHSPRGD, which encodes a fiber containing the HSP mutation and cRGD in 

10 the HI loop. The fiber gene from this plasmid was transferred into the pSQI 

backbone as follows. The plasmid pFBSEHSPRGD was digested with EcoRI and 

Xbal and the 7601 bp fragment was isolated and purified. The plasmid pSQ1 

(Figure 3B) was digested with the restriction enzymes EcoRI, Ndel, and Xbal and 

the 16,431 bp EcoRI to Ndel fragment and the 9043 bp Ndel to Xbal fragment 

15 were isolated and purified. The three DNA fragments were ligated to generate 

the plasmid pSQIHSPRGD (Figure 15A). 

To generate a recombinant adenoviral vector containing the HSP mutation 

in the fiber gene along with a cRGD ligand in the HI loop, the plasmid 

pSQIHSPRGD was digested with Clal and co-transfected into 633 cells with 

20 pAdmireRSVnBg which had been digested with Sail and Pad. After propagation 

on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 

obtain viral particles containing only the modified fiber with the HSP mutation 

and a cRGD ligand, the viral preparation was used to infect PerC6 cells, which 

do not express fiber. The resulting virus, termed Av1 nBgS*RGD, was purified 

25 by standard CsCI centrifugation procedures. 

Generation of vector containing the HSP fiber shaft mutation, the K01 fiber knob 
mutation, and a cRGD ligand in the HI loop 

The following strategy was used to generate an adenoviral vector 

containing a fiber with the HSP shaft mutation, the KOI fiber knob mutation, 

30 and a cRGD ligand in the HI loop. The plasmid p5FloxHRFRGD was digested 

with the restriction enzymes BstXI and Kpnl and the 1 1 57 bp fragment was 

isolated and purified. In addition, the fiber shuttle plasmid pFBSEHSPKOI, 



' WO 03/062400 PCT/US03/02295 



-71- 

described in Example 1 above, was digested with BstXI and Kpnl and the 4549 
bp and 3156 bp fragments were isolated and purified. The three fragments were 
ligated to generate the plasmid pFBSEHSPKOI RGD, which encodes a fiber 
containing the HSP mutation, the KOI mutation, and cRGD in the HI loop. The 
5 fiber gene from this plasmid was transferred into the pSQ1 backbone as follows. 
The plasmid pFBSEHSPKPOl RGD was digested with EcoRI and Xbal and the 
7601 bp fragment was isolated and purified. The plasmid pSQ1 (Figure 3B) was 
digested with the restriction enzymes EcoRI, Ndel, and Xbal and the 16,431 bp 
EcoRI to Ndel fragment and the 9043 bp Ndel to Xbal fragment were isolated 

10 and purified. The three DNA fragments were ligated to generate the plasmid 
pSQIHSPKOIRGD (Figure 15B). 

To generate a recombinant adenoviral vector containing the HSP and K01 
mutations in the fiber gene along with a cRGD ligand in the HI loop, the plasmid 
pSQIHSPKOIRGD was digested with Clal and co-transfected into 633 cells with 

15 pAdmireRSVnBg which had been digested with Sail and Pad. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP and KOI 
mutations and a cRGD ligand, the viral preparation was used to infect PerC6 
cells, which do not express fiber. The resulting virus, termed 

20 Av1nBgFKO!S*RGD, was purified by standard CsCI centrifugation procedures. 

EXAMPLE 8 

In Vitro Evaluation of Adenoviral Vectors Containing the HSP Fiber Shaft 
Mutation with or without the Fiber Knob KOI Mutation and with or without a 
cRGD Ligand in the HI Loop 

25 The transduction efficiencies of adenoviral vectors containing the HSP ' 

fiber shaft mutation with or without the fiber KOI mutation and with or without 
the cRGD ligand in the HI loop were evaluated on A549 cells. The transduction 
efficiencies were compared to that of AvInBg, an adenoviral vector containing 
wild type fiber. The day prior to infection, cells were seeded into 24-well plates 

30 at a density of approximately 1 x 10 5 cells per well. Immediately prior to 
infection, the exact number of cells per well was determined by counting a 
representative well of cells. Each of the vectors, AvInBg, AvInBgS*, 
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Av1 nBgFKOI S*, Av1 nBgS*RGD, and Avl nBgFKOI S*RGD, were used to 
transduce A549 cells at a particle to cell ratio of 6250. The cell monolayers 
were stained with X-gal 24 hours after infection and the percentage of cells 
expressing £-galactosidase was determined by microscopic observation and 
5 counting of cells. Transductions were done in triplicate and three random fields 
in each well were counted, for a total of nine fields per vector. The results 
(Figure 1 6) showed that the cRGD ligand dramatically increased the transduction 
efficiencies of vectors containing the HSP mutation alone or combined with the 
KOI mutation. AvInBgS* yielded approximately 22% positive cells, while 
10 AvlnBgS*RGD yielded approximately 95% positive cells. Similarly, 

Av1 nBgFKOI S* yielded only 4% positive cells, while Av1 nBgFKOI S*RGD 
yielded 85% positive cells. Therefore, the vector containing the shaft mutation 
is viable and can be retargeted with the addition of a ligand. 

EXAMPLE 9 

15 Construction Of Ad5 Vectors Containing The Ad35 Fiber And Derivatives 
Thereof 

The KOI and HSP mutations in the Ad5 fiber protein (5F), described 
above, were designed to ablate interactions that are responsible for the normal 
tropism of the Ad5 virus. An alternative strategy to detarget the virus is to 

20 replace the Ad5 fiber with a fiber from another serotype which does not bind 

CAR and which does not possess the heparin sulfate proteoglycan (HSP) binding 
domain (KKTK; SEQ ID No. 1) within the shaft. The fiber of adenovirus serotype 
35 (35F) does not bind CAR and does not possess the HSP binding domain in its 
shaft. Replacement of the 5F with the 35F can detarget the liver and provide a 

25 suitable platform for retargeting the vector to the desired tissue. 

Generation of an Ad5 based vector containing the Ad35 fiber: A PCR 
SOEing strategy was used to generate a vector based on the Ad5 serotype but 
containing the Ad35 fiber in place of the Ad5 fiber. First, PCR was used to 
amplify a region in the plasmid pSQ1 between the Xbal site at bp 25,309 and 

30 the start of the fiber gene. The primers used for this reaction were P-0O05/U 
and P-0O06/L. The DNA sequence of P-O005/U was as follows: 5' C TCT AGA 
AAT GGA CGG AAT TAT TAC AG 3' (SEQ ID No. 17). This sequence 
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corresponds to bp 25,308 through 25,334 of pSQ1 . The DNA sequence of 
P-0006/L was as follows: 5' TCT TGG TCA TCT GCA ACA AC A TGA AG A TAG 
TG 3' (SEQ ID No. 18). It contains a 10 base pair 5' extension that is 
complementary to the start of the Ad35 fiber gene, while the remainder of the 
5 primer anneals to the sequence immediately 5' of the ATG start codon of the 
fiber gene in pSQ1 . A PCR product of the expected size, 583 bp, was obtained 
and the DNA was gel purified. A second PCR amplified the Ad35 fiber gene 
using DNA extracted from wildtype Ad35 virus as a template. The primers used 
for this reaction were P-0007/U and 35FMun. The DNA sequence of P-0007/U 

10 ; was as follows: 5' GT TGT TGC AG ATG ACC AAG AGA GTC CGG CTC A 3/ 
(SEQ ID No. 19). It contains a 10 base pair 5' extension that is homologous to 
the 10 bp immediately prior to the ATG start codon of the fiber gene in Ad5. 
The remainder of the primer anneals to the start of the Ad35 fiber gene. The 
DNA sequence of 35FMun was as follows: 5' AG CAA TTG AAA AAT AAA 

15 CAC GTT GAA ACA TAA CAC AAA CGA TTC TTT A GTT GTC GTC TTC TGT 
AAT GTA AGA A 3' (SEQ ID No. 20). It contains a 46 base pair 5' extension 
that is complementary to the region of the Ad5 genome between the end of fiber 
and the Muni site 40 bp downstream of the fiber gene. In addition, the 5' 
extension encodes the last amino acid and stop codon of the Ad5 fiber gene. 

20 This region was retained in the vector because it contains the polyadenylation 
site for the fiber gene. The remainder of the primer anneals to the 3' end of the 
Ad35 fiber gene, up to the next to last amino acid codon. A PCR product of the 
expected size, 1027 bp, was obtained and the DNA was gel purified. The two 
PCR products were mixed and joined together by PCR SOEing using primers 

25 P-0005/U and P-0009. The DNA sequence of P-0009 was as follows: 5' AG 

CAA TTG AAA AAT AAA CAC GTT G 3' (SEQ ID No. 21). It corresponds to bp 
27,648 through 27,669 of pSQ1 and overlaps the Muni site in that region. A 
PCR product of the expected size, 1590 bp, was obtained and gel purified. It 
was cloned into the plasmid pCR4blunt-TOPO (Invitrogen Corporation, Carlsbad 

30 CA) using the Zero Blunt TOPO PCR Cloning Kit from Invitrogen. This 

intermediate cloning step simplified DNA sequencing of the PCR SOEing product. 
The resulting plasmid, termed pTOPOAd35F, was digested with Xbal and Muni 
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and the 1585 bp digestion product was gel purified and ligated with the 6477 bp 
fragment of pFBshuttle(EcoRI) digested with Xbal and Muni to generate the 
plasmid pFBshuttleAd35F. The Ad35 fiber gene was transferred from 
pFBshuttleAd35F into pSQ1 as follows. The plasmid pSQ1 was digested with 
5 EcoRI and the 24,213 bp fragment was gel purified. The plasmid 

pFBshuttleAd35F was linearized with EcoRI and ligated with the 24,213 bp 
fragment from pSQ1 . Restriction diagnostics were performed to screen for 
constructs containing the Ad35 fiber gene inserted into the pSQ1 backbone in 
the correct orientation. The pSQ1 plasmid containing the Ad35 fiber gene in the 

10 proper orientation was termed pSQ1 Ad35Fiber (Figure 17A). To generate 

adenoviral vector containing the Ad35 fiber, pSQ1 Ad35Fiber was digested with 
Clal and co-transfected into 633 cells with pAdmireRSVnBg which had been 
digested with Sail and Pad. After propagation on 633 cells, the resulting virus 
contained Ad5 fiber and Ad35 fibers on its capsid. The virus was amplified on 

15 PerC6 cells to generate virus containing only the Ad35 fiber on its capsid. The 
resulting virus preparation was termed Av1nBg35F. 

Construction of adenoviral vectors containing chimeric fibers derived from 
Ad5 and Ad35: Two chimeric fiber constructs were prepared by PCR gene 
overlap extension using plasmids containing the full length Ad5 or Ad35 fiber 

20 cDNAs as templates. The Ad5 fiber tail and shaft regions (5TS; amino acids 1 
to 403) were connected with the Ad35 fiber head region (35H; amino acids 137 
to 323) to form the 5TS35H chimera, and the Ad35 fiber tail and shaft regions 
(35TS; amino acids 1 to 136) were connected with the Ad5 fiber head region 
(5H; amino acids 404 to 581 ) :o form the 35TS5H chimera. The fusions were 

25 made at the conserved TLWT sequence at the fiber shaft-head junction. 

For the construction of the 5TS35H chimera, the pFBshuttle(EcoR1 ) 
plasmid was used as the template with primers P1 and P2 to generate the 5' 
fragment. The 3' fragment was generated using the pFBshuttleAd35 plasmid as 
the template with the P3 and P4 primers. The sequence of each primer used in 

30 the construction of these chimeric fibers is listed in Table 2. Amplified PCR 
products of the expected size were obtained and were gel purified. A second 
PCR was carried out with the end primers P1 and P4 to join the two fragments 
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together. The DNA fragment generated in the second PCR was digested with 
Xba1 and Muni and was cloned directly into pFBshuttle{EcoR1 ) to create the 
fiber shuttle plasmid pFBshuttle5TS35H. 

TABLE 2 

5 Primers Used For The Exchange Of Fiber Shaft Regions Between Ad5 And Ad35 

Fibers 



Primer 
designation 


Sequence 


SEQ ID 


P1 


5'-GAACAGGAGGTGAGCTTAGA-3' 


22 


P2 


5'-GTTAGGTGGAGGGTTTATTCCGGTCCAC 
A A AGTT AG CTT ATC-3 ' 


23 


P3 


5'-GATAAGCTAACTTTGTGGACCGGAATAAA 
CCCTCCACCTAAC-3' 


24 


P4 


5'-GTGGCAGGTTGAATACTAGG-3 


25 


P5 


5'-GTTAGGAGATGGAGCTGGTGTAGTCCATA 
AGGTGTTAATAC-3' 


26 


P6 


5'-GTATTAACACCTTATGGACTACACCAGCT 
CCATCTCCTAAC-3' 


27 


P7 


5'-TGCGCAAAAACAATCACCACGACAATCACAAT 
GTACATTGGAAGAAATCATACG-3' 


28 


P8 


S'-ACATTGTGATTGTCGTGGTGATT 
GTTTTTGCGCATATGCCATACAATTTGAATG-3' 


29 



For the construction of the 35TS5H chimera, the pFBshuttleAd35 plasmid 
was used as the template with the P1 and P5 primers to generate the 5' 
20 fragment. The 3' fragment was generated using the pFBshuttle(EcoR1 ) plasmid 
as the template with the P6 and P4 primers. Following the same procedure 
described above, the fiber shuttle plasmid pFBshuttle35TS5H was generated. 

For the 35TS5H and 5TS35H chimeras, the fiber gene was transferred 
from the pFBshuttle(EcoRI) backbone into pSQ1 as described above for the 
25 vector containing the Ad35 fiber. The resulting plasmids were called 

pSQ1 35T5H (Figure 18A) and pSQ15T35H (Figure 18B). In addition, adenoviral 
vectors were generated using the co-transfection strategy described above. 

Construction of Ad5 vectors containing the Ad35 fiber with a cRGD 
targeting peptide in the HI loop of the 35F fiber knob: To incorporate the cRGD 
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targeting peptide into the Ad35 fiber HI loop, the P7 and P8 oligonucleotide 
primers encoding the ten amino acid sequence HCDCRGDCFC (SEQ ID No. 30) 
were synthesized. The pFBshuttleAd35 plasmid containing the full length Ad35 
fiber cDNA was used as the template in the PCR reaction with the P1 and P7 
5 primer pair or with the P4 and P8 primer pair in order to generate the 5' and 3' 
PCR fragments. A second PCR was then carried out with the end primers P1 
and P4 to join the two fragments together. The resulting PCR fragment was 
digested with Xba1 and Muni and was cloned into pFBshuttle(EcoR1 ) to create 
the fiber shuttle plasmid pFBshuttleAd35cRGD. The modified Ad35 fiber gene 
10 was transferred into pSQ1 using the EcoRI cloning strategy described above to 
generate pSQ1 Ad35FcRGD (Figure 17B). Adenoviral vector was generated 
using the co-transfection strategy described above. 

EXAMPLE 10 

In Vitro Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
15 Thereof 

The transduction efficiencies of adenoviral vectors containing the 35 F or 
derivatives thereof were evaluated on A549 cells. The transduction efficiencies 
were compared to that of AvInBg, an adenoviral vector containing the 5F fiber. 
The day prior to infection, cells were seeded into 24-well plates at a density of 

20 approximately 1 x 10 5 cells per well. Immediately prior to infection, the exact 
number of cells per well was determined by counting a representative well of 
cells. Each of the vectors, AvInBg, Av1nBg35F, Av1nBg5T35H and 
Av1nBg35T5H were used to transduce A549 cells from 0 up to 1,000 particle 
per cell (PPC) ratios. The cell monolayers were stained with X-gal 24 hours after 

25 infection and the percentage of cells expressing /?-galactosidase was determined 
by microscopic observation and counting of cells. Transductions were done in 
triplicate and three random fields in each well were counted, for a total of nine 
fields per vector. The results (Figure 1 9) showed similar transduction 
efficiencies on A549 cells using the Av1nBg35F and Av1nBg5T35H vectors 

30 compared to AvInBg. The Av1nBg35T5H showed much lower transduction 
efficiencies on A549 cells compared to AvInBg as a result of the Ad35 shaft 
domain. The Ad35 shaft domain does not contain a HSP binding motif and the 
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Av1nBg35T5H vector behaves similarly to the AvInBgS* vector in vitro and in 
vivo. These studies also demonstrate that vectors containing fiber proteins 
without an HSP binding site are fully viable. 

EXAMPLE 1 1 

in Vivo Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
Thereof 

The objective of this study was to evaluate the in vivo biodistribution of 
adenoviral vectors containing 35F fibers and derivatives thereof to determine 
whether vectors containing these fibers ablate liver transduction due to their 
shaft regions. A positive control cohort received AvInBg and a negative control 
group received HBSS. Cohorts of five C57BL/6 mice received each vector via 
tail vein injection at a dose of 1 x 10 13 particles per kg. The animals were 
sacrificed approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Liver, heart, lung, spleen, and kidney were collected from each 
animal. The median lobe of the liver was placed in neutral buffered formalin to 
preserve the sample for £-galactosidase immunohistochemistry. In addition, 
tissue from each organ was frozen to preserve it for hexon PCR analysis to 
determine vector content. A separate sample of liver from each mouse was 
frozen to preserve it for a chemiluminescent jff-galactosidase activity assay, fi- 
galactosidase immunohistochemistry, hexon real-time PCR and the 
chemiluminescent /?-ga!actosidase activity assay were carried out as described in 
example 3. 

The results of the jff-ga!actosidase activity assay showed a dramatic 
reduction in liver transduction by vectors containing the Ad35 fiber or the 
35T5H derivative (Figure 20) with an approximately 4- to 24-fold reduction in 
galactosidase activity in the liver compared to the control vector AvInBg. These 
data demonstrate that shaft domains without HSP binding sites can effectively 
ablate hepatic in vivo gene transfer. In particular, HSP is the major entry 
mechanism for liver in vivo. CAR binding is a minor entry pathway. 
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EXAMPLE 12 

Construction Of Ad5 Vectors Containing The Ad Serotype 41 Short Fiber And 
Derivatives Thereof 

The human adenovirus serotype 41 contains two different fibers on its 

5 capsid, encoded by two adjacent genes. One fiber has a molecular weight of 

60kDa and is approximately 315A in length and is termed the long fiber. The 

other fiber has a molecular weight of 40kDa and is approximately 250-J- in 

length and is termed the short fiber. The Ad41 short fiber does not bind CAR 

and does not possess the heparin binding domain (KKTK) in its shaft. Therefore, 

10 this fiber provides a useful platform for adenoviral vector targeting. 

Construction of adenoviral vectors based on Ad5 but containing the Ad41 

short fiber: A PCR SOEing strategy was used to generate a vector based on the 

Ad5 genome but containing the Ad41 short (Ad41s) fiber. First, PCR was used 

to amplify the region of pSQ1 between the Xbal site at bp 25,309 and the start 

15 of the fiber gene. The primer pair used for the PCR were P-0005/U and 

P-0010/L. The DNA sequence of P-0005/U was as follows: 5' C TCT AGA AAT 

GGA CGG AAT TAT TAC AG 3' (SEQ ID No. 17). The sequence corresponds to 

bp 25,308 through 25,334 of pSQ1 and overlaps the Xbal site in that region. 

The DNA sequence of P-0010/L was as follows: 5' TTC TTT TCA T CTG CAA 

20 CAA CAT GAA GAT AGT G 3' (SEQ ID No. 31). It contains a 5' extension 

corresponding to the first 10 bp of the Ad41s fiber gene. The remainder of the 

primer anneals to pSQ1 immediately 5' of the ATG start codon of the fiber gene. 

The PCR product was the expected size (583 bp). A second PCR was used to 

amplify the Ad41s fiber using the plasmid pDV60Ad41sF as a template. The 

25 primers used were P-001 1/U and P-0012/L. The DNA sequence of P-001 1/U 

was as follows: 5' GT TGT TGC AG ATG AAA AGA ACC AGA ATT GAA G 3' 

(SEQ ID No. 32). It contains a 10 bp 5' extension corresponding to the DNA 

sequence immediately 5' of the ATG start codon of the fiber gene in pSQ1 . The 

remainder of the primer anneals to the beginning of the Ad41s fiber gene in 

30 pDV60Ad41sF. The DNA sequence of P-001 2/L was as follows: 5' TG CAA 

TTG AAA AAT AAA CAC GTT GAA ACA TAA CAC AAA CGA TTC TTT ATT C 

TTC AGT TAT GTA GCA AAA TAC A 3' (SEQ ID No. 33). It contains a 51 bp 
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5' extension corresponding to the sequence in pSQ1 from the last codon of the 
fiber gene through the Muni site 40 bp downstream of the fiber gene. The 
remainder of the primer anneals to the 3' end of the Ad41s fiber gene in 
pDV60Ad41sF. The PCR product was the expected size (1219 bp). The two 
5 PCR products were joined by PCR SOEing using primers P-0005/U and P-0O09/L. 
The DNA sequence of P-0009/L was described above. The PCR SOEing reaction 
yielded the expected 1782 bp product. The product was cloned into 
pCR4blunt-T0P0 to yield pCR4blunt-TOPOAd41 sF. Next, 
pCR4blunt-TOPOAd41sF was digested with Xbal and Muni and the 1773 bp 

10 fragment containing the Ad41s fiber gene was gel purified. This fragment was 
ligated with the 6477 bp Xbal to Muni fragment of pFBshuttle(EcoRI) to generate 
pFBshuttleAd41sF. The Ad41s fiber gene was transferred into the pSQ1 
backbone as follows. First, pFBshuttleAd41sF was linearized using EcoRI and 
this fragment was ligated with the 24,213 bp EcoRI fragment of pSQ1 to 

15 generate pSQ1Ad41sF (Figure 21 A). Adenoviral vector containing the Ad41s 
fiber was generated using the co-transfection strategy described above. 

Construction of Ad5 adenoviral vectors containing the Ad41 short fiber 
with a cRGD targeting ligand in the HI loop: A PCR SOEing strategy was used to 
generate a construct containing the Ad41s fiber with cRGD in the HI loop. The 

20 plasmid pFBshuttleAd41 sF was used as a template for the PCR amplifications. 
First, a 1782 bp fragment was amplified using primers 5FF and 41sRGDR. The 
primer 5FF was described above. It anneals to pFBshuttleAd41 sF at the Xbal 
site upstream of the fiber gene. The DNA sequence of the primer 41sRGDR was 
follows: 5' AGT ACA AAA ACA ATC ACC ACG ACA ATC ACA GTT TAT 

25 CTC GTT GTA GAC GAC ACT GA 3' SEQ ID No. 34). It contains a 30 bp 5' 

extension that encodes the cRGD targeting ligand. The remainder of the primer 
anneals to pFBshuttleAd41 sF from bp 2878 through 2903. A second PCR 
amplified a 277bp region of pFBshuttleAd41 sF using primers 3FR and 41 sRGDF. 
The primer 3FR was described previously. It anneals to pFBshuttleAd41 sF at the 

30 Muni site downstream of the fiber gene. The DNA sequence of 41 sRGDF was 
as follows: 5' TGT GAT TGT CGT GGT GAT TGT TTT TGT ACT AGT GGG TAT 
GCT TTT ACT TTT 3' (SEQ ID No. 35). It contains a 30 bp 5' extension that 
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encodes the cRGD targeting ligand and is complementary to the extension on 
41 sRGDR. The remainder of the primer anneals to pFBshuttleAd41 sF from bp 
2904 through 2924. The two PCR products were joined by PCR SOEing to 
generate a 2059 bp fragment using primers 5FF and 3FR. The product was 
5 digested with Xbal and Muni and the 1803 bp DNA fragment was gel purified. 
The fragment was ligated with the 6477 bp fragment resulting from digestion of 
pFBshuttle(EcoRI) with Xbal and Muni. The resulting plasmid was termed 
pFBshuttleAd41 sRGD. This plasmid was linearized by EcoRI digestion and 
ligated with the 24,21 3bp EcoRI fragment of pSQ1 to generate P SQ1Ad41sRGD 
10 (Figure 21 B). 

EXAMPLE 13 

In Vivo Evaluation Of Ad5 Vectors Containing The Ad41 Short Fiber And 
Derivatives Thereof 

This example evaluates the in vivo biodistribution of adenoviral vectors 
15 containing 41 sF fibers and derivatives thereof to determine whether vectors 
containing the these fibers ablate liver transduction due to modified shaft 
regions. A positive control cohort received Av3nBg (see, Gorziglia eta/. (1996) 
J. Virology 70:4173-4178) or Ad5.£Gal.AF/5F, and a negative control group 
received HBSS. Ad5./?Gal.AF/5F is a derivative of the fiberless vector 
20 Ad5.y9gal.AF (ATCC accession number VR2636) modified to express AD5 fiber 
(see, e.g., International PCT application No. WO01 83729). 

The Ad5.yffGal.AF vector was pseudotyped with the Ad41sF fiber protein 
and injected in vivo. Cohorts of five C57BL/6 mice received each vector via tail 
vein injection at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 
25 approximately 72 hours after vector administration by carbon dioxide 

asphyxiation. Liver, heart, lung, spleen, and kidney were collected from each 
animal. The median lobe of the liver was placed in neutral buffered formalin to 
preserve the sample for £-galactosidase immunohistochemistry. In addition, 
tissue from each organ was frozen to preserve it for hexon PCR analysis to 
30 determine vector content. A separate sample of liver from each mouse was 
frozen to preserve it for a chemiluminescent /?-galactosidase activity assay, p- 
galactosidase immunohistochemistry, hexon real-time PCR and the 
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chemiluminescent yff-galactosidase activity assay was carried out as described in 
example 3. 

The results of the hexon DNA analysis showed a dramatic reduction in 
liver transduction by vectors containing the Ad41sF fiber (Figure 22) with an 
approximately a 5-fold reduction in liver adenoviral DNA content compared to 
either control vector. 

In the above examples, several novel adenoviral vectors were generated 
containing various fiber modifications designed to ablate the normal tropism of 
the vector. See Table 3. Vectors were generated in which the heparan sulfate 
binding domain in the fiber shaft was replaced by amino acid substitutions. This 
mutation, termed HSP, was also combined with the KOI mutation (fiber knob AB 
loop mutation that ablates CAR binding), and the PD1 mutation (penton mutation 
that eliminates RGD/integrin interaction). In addition, a vector containing all three 
mutations (HSP, KOI, PD1) was generated. All vectors containing the HSP 
mutation, either alone or combined with other capsid modifications, showed 
dramatically reduced transduction efficiencies on A549 and HeLa cells. 
Furthermore, the same vectors showed dramatically reduced transduction of the 
liver following systemic delivery to mice. As an alternative strategy to ablate the 
normal tropism of Ad5-based vectors, the Ad5 fiber was replaced by a fiber from 
a different adenovirus serotype which does not bind CAR and does not contain 
the heparan binding domain in the shaft. Thus, vectors were generated 
containing the Ad35 fiber and the Ad41 short fiber. Versions of these two 
vectors containing a cRGD targeting ligand in the HI loop of the fiber were also 
produced. Additionally, vectors containing chimeric fibers were generated. A 
vector containing the Ad35 fiber tail and shaft regions fused to the Ad5 fiber 
knob domain as well as a vector containing the Ad5 fiber tail and shaft fused to 
the Ad35 fiber knob domain were constructed. Vectors containing either the 
entire Ad35 or Ad41 short fiber showed a significant reduction in liver 
transduction following delivery to mice via the tail vein. The observation of 
reduced liver transduction using vectors containing either an HSP mutation, the 
Ad35 fiber, or the Ad41 short fiber indicates the feasibility of detargeting 
adenoviral vectors in vivo. In vitro data with the Ad35 fiber or the Ad41 short 
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fiber with cRGD (see Example 14) indicate that the virus is completely viable, 
that is, it is not damaged by the absence of an HSP binding site and is 
retargetable. Taken together these data suggest that these vectors provide a 
suitable platform' for retargeting strategies. 
5 TABLE 3 



Description Of Recombinant Adenoviral Vectors Used 
To Demonstrate That Shaft Modifications Influence Tropism In Vivo 

Vector 



Vector 


Description 


Av1 nBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing /?-galactosidase 


Ad 5 Fiber derivatives: 


Av1nBgFK01 


The same as AvInBg but containing the KOI AB loop 
mutation in the fiber gene 


Av1nBgPD1 


The same as AvInBg but containing the penton PD1 mutation 
that deletes the integrin binding, RGD tripeptide 


AvInBgS* 


The same as AvInBg but containing the 4 amino acid 
substitution in the shaft referred to as S* that modifies the 
HSP binding motif 


Av1 nBgFKOI S* 


The same as AvInBg but containing the fiber K01 and S* 
mutations combined 


Av1nBgS*PD1 


The same as AvInBg but containing the fiber S* and penton 
PD1 mutations combined 


Av1nBgFK01S*PD1 


The same as AvInBg but containing the fiber K01, S* and 
penton PD1 mutations combined 


Ad35 fiber derivatives: 


Av1nBg35F 


The same as AvInBg but containing the full length Ad35 fiber 
cDNA 


Av1nBg5T35H 


The same as AvInBg but containing the 5T35H chimeric fiber 


Av1nBg35T5H 


The same as AvInBg but containing the 35T5H chimeric fiber 


Av1nBg35FRGD 


The same as AvInBg but containing the full length Ad35 fiber 
cDNA with a cRGD ligand in the HI loop of the Ad35 fiber 


Ad41sF fiber derivatives: 


Av1nBg41sF 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA 
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Vector 


Description 


Av1 nBg41 sFRGD 


The same as AvInBg but containing the full length AcJ41 short 
fiber cDNA with a cRGD iigand in the HI loop of the Ad41 
short fiber 



EXAMPLE 14 

In Vitro Evaluation Of Adenoviral Vectors Containing The Ad41sF With A cRGD 
Ligand In The HI Loop 

The transduction efficiencies of adenoviral vectors containing the Ad41sF 
fiber with the cRGD ligand in the HI loop were evaluated on A549 cells. The 
transduction efficiencies were compared to that of AvInBg, an adenoviral vector 
containing wild type fiber or Av1 nBgFKOI RGD, an adenoviral vector containing 
the KOI mutation in combination with the cRGD ligand in the HI loop. The day 
prior to infection, cells were seeded into 24-well plates at a density of 
approximately 1 x 10 5 cells per well. Immediately prior to infection, the exact 
number of cells per well was determined by counting a representative well of 
cells. Each of the vectors, AvInBg, Av1 nBgFKOI RGD, and Av1 nBg41 sFRGD 
were used to transduce A549 cells at a particle to cell ratios of 0 up to 10,000. 
The cell monolayers were stained with X-gal 24 hours after infection and the 
percentage of cells expressing #-galactosidase was determined by microscopic 
observation and counting of cells. Transductions were done in triplicate and 
three random fields in each well were counted, for a total of nine fields per 
vector. The results (Figure 23) show that the Av1 nBg41 sFRGD vector 
transduced cells to an equivalent level as Av1 nBgFKOI RGD at all vector doses 
examined. Neither FK01 or Ad41sF can bind CAR. The Ad41sF does not 
normally interact with CAR and additionally does not contain the HSP binding 
motif within the shaft domain. These data show that targeting peptides inserted 
into the loop regions of the fiber knob of KOI and Ad41sF allows for 
transduction of target cells via the targeted receptor. Surprisingly, HSP, not 
CAR and integrins, is the major entry route in vivo and ablation of HSP binding 
permits targeting of adenoviral vectors. 
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EXAMPLE 15 

Effect of the shaft modification on the biodistribution of adenoviral vectors in 
vivo 

The influence of fiber and penton modifications on the in vivo 
5 biodistribution of adenoviral vectors containing fiber head, shaft and penton 
mutations was examined. Vectors containing the HSP mutation combined with 
K01 , or PD1 , or a combination of all three mutations were evaluated as well as 
vectors containing the KOI mutation alone and the PD1 mutation alone. The 
indicated adenoviral vectors were systemically administered to C57BL6 mice as 
10 described above. A positive control cohort received AvInBg and a negative 
control group received HBSS. Cohorts of five C57BL/6 mice received each 
vector via tail vein injection at a dose of 1 x 10 13 particles per kg. The animals 
were sacrificed approximately 72 hours after vector administration by carbon 
dioxide asphyxiation. Liver, heart, lung, spleen, and kidney were collected from 
15 each animal. Tissue from each organ was frozen to preserve it for real time PCR 
analysis to determine adenoviral hexon DNA content. A separate sample of liver 
from each mouse was frozen to preserve it for a chemiluminescent fi- 
galactosidase activity assay. Hexon real-time PCR and the chemiluminescent fi- 
galactosidase activity assay was carried out as described in Example 3. 
20 The results derived from the liver are described in Example 6 (Figure 1 4A 

and B) and also shown in Figure 26 with results presented as percent control of 
AvInBg. The effect of the S* shaft modification on the biodistribution of 
adenovirus to the other organs is shown in Figure 25. The average adenoviral 
DNA content was determined as adenoviral genomic copies per cell and 
25 expressed as a percentage of the AvInBg ( + ) control value. The average 

percent control value + standard deviation is shown (n = 5 per group) for each 
tissue examined (Figure 25). Systemic delivery of Ad5 based vectors with 

wild-type fiber results in a preferential accumulation of vector DNA in the liver 
with 64 copies per cell with significantly less DNA found in the other organs 
30 with 1 .32 copies per cell found in lung, 2.18 copies per cell in spleen, 0.47 
copies per cell found in heart, and 0.72 copies per cell in the kidney. All 
differences found with PD1, S*, KQ1PD1, KOIS*, S*PD1, and KQ1S*PD1 were 
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significantly different than the AvInBg ( + ) control using a unpaired, t-test 
analysis, P value ( 0.024. When expressed as a percent of the AvInBg control 
values, the influence of each mutation, individually or in combination, becomes 
apparent. The S* mutation dramatically reduced gene transfer to all four organs, 
5 whereas, the KOI mutation did not. Thus, the importance of the shaft for 

transduction in vivo extends to organs besides the liver. Finally, gene transfer to 
the lung, heart, and kidney was diminished with PD1 suggesting a role for 
integrin binding in vector entry in these organs. 

EXAMPLE 16 

10 Retargeting the S*, shaft modification and the 41 sF fiber in vivo 

Vectors containing the HSP mutation have been shown to effectively 
detarget adenoviral vectors in vivo (see examples 6 and 15). The objective of 
this study was to evaluate the ability to retarget vectors containing the S* 
modification or the Ad41sF to tumors in vivo. A cRGD peptide was genetically 

15 incorporated into the fiber HI loop and evaluated in vitro (Examples 8 and 14). 

These same vectors were then evaluated in vivo in tumor-bearing mice. Athymic 
nu/nu female mice were injected with 8 x 10 6 A549 cells on the right hind flank. 
When tumors reached approximately 100mm3 in size, they were randomized 
into treatment groups. Cohorts of 6 mice received each vector via tail vein 

20 injection at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 
approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Tumor, liver, heart, lung, spleen, and kidney were collected from 
each animal. Tissue from each organ was frozen to preserve it for real time PCR 
analysis to dete mine adenoviral hexon DNA content. Hexon real-time PCR was 

25 carried out as described in example 3. A separate sample of liver from each 

mouse was frozen to preserve it for a chemiluminescent yff-galactosidase activity 
assay. Hexon real-time PCR and the chemiluminescent jff-galactosidase activity 
assay was carried out as described in example 3. 

The adenoviral vector biodistribution to the liver and tumor for each 

30 treatment group is shown in Figure 27. Vectors containing the S*, KOIS*, and 
41 sF fibers effectively detargeted the liver and tumor resulting in a significant 
reduction in the amount of adenoviral DNA found in each tissue in comparison to 
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the AvInBg control. Vectors containing the cRGD targeting ligand restored 
tranduction of the tumors to levels comparable to that achieved with the 
untargeted vector. 

These data demonstrate successful liver detargeting accompanied with 
5 tumor retargeting. The extent of tumor retargeting is relates to the affinity and 
type of ligand that is used. These data demonstrate the successful 
development of a targeted, systemically deliverable adenoviral vector that will 
target tumors in vivo. 

EXAMPLE 17 

10 Scale-Up Method For The Propagation Of Detargeted Adenoviral Vectors 

The growth and propagation of doubly or triply ablated adenoviral vectors 
requires novel scale up technologies. These detargeted vectors require 
alternative cellular entry strategies to allow for the efficient growth and 
generation of high titer preparations. A strategy for vector growth that is 

15 generally applicable to all detargeted adenoviral vectors, that does not require 
the development of new cell lines, and that aslo can be used for generating 
targeted vectors is provided herein. 

Three recombinant adenoviral vectors were prepared that contain single 
mutations in the fiber or penton or both mutations combined into one vector. 

20 These vectors are designated Av3nBgFK01, Av1nBgPD1, and Av1 nBgFKOI PD1 , 
respectively. The construction of these vectors is described above and a general 
description of each vector can be found in Table 1 above. 

Scale-up of detargeted adenoviral vectors: A polycation, specifically 
hexadimethrine bromide was obtained from Sigma Chemical Co (St. Louis, MO), 

25 Catalog No. 52495, and was maintained in the medium at 4 /yg/ml during the 
course of transfections and infections. To illustrate the affects of 
hexadimethrine bromide on the yield of detargeted adenoviral vectors the 
following experiment was carried out. Seven plates of AE1-2a adenoviral 
producer cells (Gorziglia et al. (1996) ,7. Virology 70:4173-4178) were 

30 transduced with 10 particles per cells of each of the indicated vectors (See Table 
4). Each vector was incubated with medium (Richters with 2% HI-FBS) 
containing hexadimethrine bromide at 4 /yg/ml for 30 min at room temperature 
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prior to infection. The infection was carried out for 2 hrs. Complete medium 
containing hexadimethrine bromide at 4//g/ml was added to each plate. Final 
concentration of hexadimethrine bromide in all of these experiments was 
maintained at 4 /vg/ml. The titers were determined spectrophotometrically using 
5 the conversion of 10D at A260nm per 1 x 10 12 particles (Mittereder et al. 

(1996) J Virology 70:7498-7509). The total particle yield was then normalized 
for the number of plates used for transduction. 

The inclusion of hexadimethrine bromide in the medium during the course 
of infection allows for the efficient propagation of detargeted adenoviral vectors 

10 containing fiber and penton mutations either alone or in combination. The affect 
of hexadimethrine bromide on vector yields is shown in Table 4. A 35-fold 
improvement in the yield of Av3nBgFKO! was found when hexadimethrine 
bromide was included in the culture medium and resulted in increasing the vector 
yield from 1 .3 x 10 10 up to 4.6 x 10 11 vector particle per plate. Hexadimethrine 

15 bromide has a minimal effect on the yield of the AvInBgPDI adenoviral vector 
containing the penton, PD1 mutation with only a 1 .2 fold improvement. The 
greatest effect using hexadimethrine bromide was found on the propagation of 
the doubly ablated adenoviral vector, Av1 nBgFKOI PD1 with increases in vector 
yield from barely detectable levels up to 4.53 x 10 10 vector particles per plate. 

20 These data demonstrate that use of nonspecific entry mechanisms allows for the 
efficient scale-up of detargeted adenoviral vectors. 

TABLE 4 



Efficient Scale-Up Of Detargeted Adenoviral Vectors Using hexadimethrine 

bromide 



Vector 


Vector Yield (particles/plate) 


Fold 
Improvement 


{-) hexadimethrine 
bromide 


( + ) hexadimethrine 
bromide 


AvInBg 


3.89 x 10 11 


5.72 x 10 n 


1.47 


Av3nBg 


8.58 x 10 10 


2.38 x 10 11 


2.77 


Av3nBgFK01 


1.30 x 10 10 


4.60 x 10" 


35.4 


Av1nBgPD1 


1.95 x 10 11 


2.40 x 10 11 


1.23 


AvInBgFKOl PD1 


TLTC* 


4.53 x 10 10 


T 
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*TLTC: Too low to count, a faint virus band was collected and the particle 
concentration was too dilute for titer determination. 
* Significant improvement 

The use of alternative polycations including protamine sulfate and 
5 poly-lysine as well as bifunctional proteins such as the anti-penton:TNFor fusion 
protein was investigated. Figure 24 show results that demonstrate all the 
reagents tested had some effect on enhancing transduction of the Av3nBgFK01 
vector. All of these compounds, when maintained in the medium during 
infection, enhanced transduction of the Av3nBgFK01 detargeted adenoviral 
10 vector. 

Bifunctional reagents:. The use of bifunctional reagents for the 
propagation of detargeted adenoviral vectors was examined using the 
anti-penton:TNFa fusion protein. This particular reagent is a fusion protein 
between an antibody against Ad5 penton and the TNFa protein that is produced 

1 5 using stably transfected insect cells. This reagent will bind specifically to the 
adenoviral capsid via penton base and allow for binding to cell surface TNF 
receptors. The use of this reagent for the propagation of detargeted vectors is 
illustrated in Table 5 using Av3nBgFK01 (also shown in Figure 24). Monolayers 
of S8 cells were infected with 10 or 100 particles per cell of Av3nBgFK01 or a 

20 control vector in the presence or absence of 1 //g/ml of the anti-penton:TNFa 

fusion protein. The monolayers were visually inspected over time for vector 

spread as indicated by the extent of cytopathic effect (CPE). The percentage of 

CPE at each time point is shown. The use of this bifunctional reagent clearly 

enhances the spread of the Av3nBgFK01 vector throughout the monolayer. 

25 TABLE 5 

Efficient Scale-Up Of Detargeted Adenoviral 
Vectors Using Bifunctional Reagents: Anti-Penton:TNFa 





10 ppc - 
anti-penton 
TNF 


10 ppc + 
anti-penton 
TNF 


100 ppc- j 
anti-penton TNF 


100 ppc + 
anti-penton TNF 




Perce 


,ntage of CPE 


Ad5Luc1 




24 h 


0% 


0% 


0% 


0% 
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10 ppc - 
anti-penton 
TNF 


10 ppc + 
anti-penton 
TNF 


100 ppc - 
anti-penton TNF 


1 00 ppc + 
anti-penton TNF 




Percentage of CPE 


48 h 


20-30% 


20-30% 


90-100% 


90-100% 


72 h 


60-70% 


80-90% 


100% 


100% 


120 h 


100% 


100% 


100% 


100% 


Av3nBgK01 24hrs \ 


24 h 


0% 


0% 


0% 


0% 


48 h 


0% 


10-20% 


0% 


90-100% 


72 h 


5% 


60-70% 


5% 


100% 


120 h 


40-50% 


100% 


100% 


100% 



Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A modified adenovirus capsid protein, 

the unmodified capsid protein binds to heparin sulfate 
proteoglycan <HSP); and 
5 the capsid protein comprises a mutation, whereby binding to 

heparin sulfate proteoglycan (HSP) is altered. 

2. The modified protein of claim 1 that is a fiber protein 

3. The capsid protein of claim 2, wherein the binding of the modified 
fiber protein is eliminated or reduced compared to the unmodified protein. 

10 4. The modified protein of claim 2, wherein the binding of the 

modified fiber protein is eliminated or reduced compared to the unmodified 
protein. 

5. The modified protein of claim 3 that comprises an insertion, 
deletion or replacement of amino acids. 
15 6. The modified protein of claim 2, wherein the mutation alters the 

motif that binds to HSP, whereby HSP interaction is altered. 

7. The modified protein of claim 6, motif is BBXB or BBBXXB, 
wherein the B is a basic amino acid and X is any amino acid. 

8. The modified protein of claim 7, wherein the motif comprises the 
20 consensus sequence KKTK. 

9. The modified protein of claim 2, wherein the fiber is a modified 
Ad5 or Ad2 fiber. 

1 0. A modified protein of claim 2 that is a chimeric fiber protein, 
comprising portions of fiber proteins from at least two different adenoviruses, 

25 wherein: 

a shaft or portion thereof is from a first adenovirus, whereby the resulting 
fiber does not bind to HSP or binds to HSP with reduced affinity compared to an 
unmodified fiber protein; 

a shaft or portion thereof from the first adenovirus does not bind 
30 to HSP or binds to HSP with reduced affinity compared to the second 
adenovirus; 

the second adenovirus binds to HSP; and 
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the portion comprises a sufficient portion to alter HSP binding of 
the resulting protein. 

1 1 . The modified protein of claim 10, wherein the binding to HSP of 
the modified fiber protein is eliminated or reduced compared to the unmodified 

5 protein. 

12. The modified protein of claim 10, wherein the remainder of the fiber 
protein is from the second adenovirus. 

13. The modified protein of any of claims 2, 3, 10 and 11, further 
comprising one or more further modifications that reduce or eliminate interaction 

10 of the resulting fiber with one or more cell surface proteins in addition to HSP. 

14. The modified protein of claim 13, further comprising a ligand, 
whereby the resulting fiber binds to a receptor for the ligand. 

15. The modified protein of claim 14, wherein the ligand is included in 
the knob region. 

15 16. The modified protein of claim 14, wherein the ligand is inserted or 

it replaces a portion of the fiber, whereby the resulting fiber binds to a receptor 
for the ligand. 

17. A modified protein of claim 1 1 , wherein affinity for HSP is reduced 
at least by an amount selected from among reduced 5-fold, 10-fold and 100- 

20 fold. 

18. The modified protein of claim 1 1, wherein the first adenovirus is 
selected from the group consisting of subgroup B, D or F, and the second is of 
subgroup C. 

19. The modified protein of claim 10, wherein the first adenovirus is 
25 selected from the group consisting of Ad3, Ad35, Ad7, Ad1 1 , Ad1 6, Ad21 , 

Ad34, Ad40, Ad41 and Ad46. 

20. The modified protein of claim 1 8, wherein the second adenovirus 
is Ad5 or Ad2. 

21. The modified protein of claim 19, wherein the second adenovirus 
30 is Ad5 or Ad2. 

22. A modified protein of claim 1 selected from the group consisting of 
a fiber protein comprising: 
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the sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 
58, 62, 66, 70 and 72; or 

a sequence of amino acids having 90% sequence identity with a 
sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 
5 70 and 72; or 

a sequence of amino acids encoded by a sequence of nucleotides that 
hybridizes under conditions of high stringency along at least 70% of its length to 
a sequence of nucleotides that encodes a sequence of amino acids set forth in 
any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 70 and 72. 
10 23. A nucleic acid molecule encoding a modified protein of any of 

claims 1-12 and 14-22. 

24. A nucleic acid molecule encoding a modified protein of claim 13. 

25. The nucleic acid molecule of claim 23 that comprises a vector. 

26. The nucleic acid molecule of claim 24 that comprises vector. 

15 27. The nucleic acid molecule of claim 25 that is an adenovirus vector. 

28. The nucleic acid molecule of claim 26 that is an adenovirus vector. 

29. The vector of claim 27 that is an adenoviral vector from a 
subgroup B, C or D adenovirus. 

30. The vector. of claim 28 that is an adenoviral vector from a 
20 subgroup B, G or D adenovirus. 

31 . A cell, comprising a nucleic acid molecule of claim 23. 

32. A cell, comprising a nucleic acid molecule of claim 24. 

33. The cell of claim 31 that is a eukaryotic cell. 

34. The cell of claim 32 that is a eukaryotic cell. 

25 35. A cell, comprising a nucleic acid molecule of claim 27, wherein: 

the cell is a eukaryotic cell; and 

the cell in a packaging cell. 
36. A cell, comprising a nucleic acid molecule of claim 28, wherein: 

the cell is a eukaryotic cell; and 
30 the cell in a packaging cell. 
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37. An adenoviral particle, comprising a modified protein of any of 
claims 1-12 and 14-22, whereby binding of the viral particle to HSP is altered 
compared to a particle that expresses an unmodified fiber. 

38. An adenoviral particle, comprising a modified protein of claim 13, 

5 whereby binding of the viral particle to HSP is altered compared to a particle that 
expresses an unmodified fiber. 

39. An adenoviral particle of claim 37, wherein a native receptor for 
the fiber is coxsackie-adenovirus receptor (CAR). 

40. The adenoviral particle of claim 39, further comprising a mutation 
10 in the CAR-binding region of the capsid. 

41 . The adenoviral particle of claim 39, further comprising a mutation 
in the a v integrin-binding region of the capsid, whereby binding to the integrin is 
eliminated or reduced. 

42. The adenoviral particle of claim 40, further comprising a mutation 
15 in the a v integrin-binding region of the capsid, whereby binding to the integrin is 

eliminated or reduced 

43. The adenoviral particle of claim 39, wherein the CAR-binding 
region of the capsid modified is on a fiber knob. 

44. The adenoviral particle of claim 43, wherein the fiber knob 
20 modification is in the AB loop or CD loop. 

45. The adenoviral particle of claim 44, wherein the fiber knob 
modification is selected from the group consisting of K01 and K012. 

46. The adenoviral particle of claim 39, wherein the adenovirus is a 
subgroup C, D or F adenovirus. 

25 47. The adenoviral particle of claim 46, wherein the subgroup C virus 

is Ad2 or Ad5, the subgroup D virus is Ad46 and the subgroup F virus is Ad41 . 

48. The adenoviral vector of claim 27 that is an early generation 
adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 

30 49. The adenoviral vector of claim 28 that is an early generation 

adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 



WO 03/062400 



PCT/US03/02295 



-94- 

50. The adenoviral vector of claim 48, wherein the replication-condi- 
tional adenoviral vector is an oncolytic adenoviral vector. 

51. The adenoviral vector of claim 49, wherein the replication-condi- 
tional adenoviral vector is an oncolytic adenoviral vector. 

5 52. The adenoviral vector of claim 27 that comprises heterologous 

nucleic acid. 

53. The adenoviral vector of claim 28 that comprises heterologous 
nucleic acid. 

54. The adenoviral vector of claim 52, wherein the heterologous 
10 nucleic acid encodes a polypeptide. 

55. The adenoviral vector of claim 53, wherein the heterologous 
nucleic acid encodes a polypeptide. 

56. The adenoviral vector of claim 52, wherein the heterologous 
nucleic acid comprises or encodes a regulatory nucleic acid. 

-15 57. The adenoviral vector of claim 53, wherein the heterologous 

nucleic acid comprises or encodes a regulatory nucleic acid. 

58. The adenoviral vector of claim 52, wherein the heterologous 
nucleic acid comprises or encodes a promoter or RNA. 

59. The adenoviral vector of claim 53, wherein the heterologous 
20 nucleic acid comprises or encodes a promoter or RNA. 

60. The adenoviral vector of claim 59, wherein the promoter is a cell 
or tissue specific promoter. 

61. The adenoviral vector of claim 59, wherein the promoter is 
operably linked to a gene of an adenovirus essential for replication. 

25 62. The adenoviral vector of claim 60, wherein the tissue specific 

promoter is a tumor specific promoter. 

63. The adenoviral vector of claim 58, wherein the polypeptide is a 
therapeutic polypeptide. 

64. A method of expressing heterologous nucleic acid in a cell, 
30 comprising transducing the cell with an adenoviral vector of claim 57. 

65. The method of claim 64, wherein: 
the cell is a tumor cell; 
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the adenoviral vector is an oncolytic vector; and 
the cell is killed . 

66. The method of claim 64, wherein the cell is a mammalian cell. 

67. The method of claim 64, wherein the cell is a primate cell. 
5 68. The method of claim 67, wherein the cell is a human cell. 

69. A method of reducing transduction of liver cells by an adenoviral 
particle, comprising reducing or eliminating binding of the particle to heparin 
sulfate proteoglycans (HSPs) on the liver cells. 

70. A scale up method for the propagation of a detargeted adenoviral 
10 particle, comprising: 

infecting a cell capable of replicating, maturing and packaging an 
adenoviral vector with a detargeted adenoviral vector in the presence of a 
reagent that results in entry of the adenoviral particle into the cell; 

culturing the infected cell under conditions suitable for growth, spread 
15 and propagation of the adenoviral vector; and 

recovering the resulting adenoviral particles. 

71 . The method of claim 70, wherein the reagent is a polycation. 

72. The method of claim 71, wherein the polycation is selected from 
the group consisting of hexadimethrine bromide, polyethylenimine, protamine 

20 sulfate and poly-L-lysine. 

73. The method of claim 70, wherein the reagent is a bifunctional 
protein that binds to the adenoviral particle and to a receptor on the cell. 

• 74. The method of claim 73, wherein: 
the bifunctional protein is selected from the group consisting of an 
25 anti-fiber antibody ligand fusion, an anti-fiber-Fab-FGF conjugate, an 

anti-penton-antibody ligand fusion, an anti-hexon antibody ligand fusion and a 
polylysine-peptide fusion, wherein the ligand is a ligand that binds to the 
receptor. 

75. The method of any one of claims 70-74, wherein the detargeted 
30 adenoviral particle expresses a modified capsid, whereby binding to at least one 
host cell receptor is reduced or eliminated compared with a wild-type 
adenovirus. 
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76. The method of claim 75, wherein the adenoviral particle is 
modified to eliminate or reduce binding with one host cell receptor. 

77. The method of claim 75, wherein the adenoviral particle is 
modified to eliminate or reduce binding with two host cell receptors. 

5 78. The method of claim 75, wherein the adenoviral particle is 

modified to eliminate or reduce binding with three host cell receptors. 

79. The, method of claim 75, wherein the particle is modified with one 
or more mutations selected from the group consisting of mutations that reduce 
or eliminate interactions with one or more of or v integrins, coxsackie-adenovirus 

10 receptors (CAR), and heparin sulfate proteoglycans (HSP). 

80. The method of claim 79, wherein the mutation is selected from the 
group consisting of PD1, KOI, K012 and S*. 

81. The modified protein of claim 2, wherein the mutation is in the 
shaft of a fiber, 

15 82. A modified protein of claim 3, wherein affinity for HSP is reduced 

at least by an amount selected from among reduced 5-fold, 10-fold and 100- 
fold. 
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SEQUENCE LISTING 
<110> The Scripps Research Institute 
Novartis AG 
Kaleko, Michael 
Nemerow, Glen R. 
Smith, Theodore 
Stevenson, Susan C. 

<12 0> Fiber Shaft Modifications for Efficient Targeting 

<130> 22908-123 6PC 

<140> Not yet assigned 
<141> Herewith 

<150> 60/350,388 
<151> 2002-01-24 

<150> 60/391,967 
<151> 2002-06-26 

<160> 72 . 

<170> Patentln version 3.0 

<210> 1 

<211> 4 

<212> PRT 

<213> adenovirus serotype 5 

<400> 1 

Lys Lys Thr Lys 

1 

<210> 2 

<211> 1746 

<212> DNA 

<213> adenovirus serotype 5 



<400> 2 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 



caagaccgtc 

caactgtgcc 

ctggggtact 

tcaaaatggg 

ccactgtgag 

ccctcacagt 

gcaacacact 

ttgccaccca 

ccctcaccac 

ccactggtag 

gactaaagta 

ctggtccagg 

gttttgattc 

aaaacagacg 

taagactagg 

acaaaggcct 

gcactgccaa 

ttgaatttgg 

gcctagaatt 

acagcacagg 

caccagctcc 



tgaagatacc 

ttttcttact 

ctctttgcgc 

caacggcctc 

cccacctctc 

tacctcagaa 

caccatgcaa 

aggacccctc 

caccgatagc 

cttgggcatt 

cggggctcct 

tgtgactatt 

acaaggcaat 

ccttatactt 

acagggccct 

ttacttgttt 

ggggttgatg 

ttcacctaat 

tgattcaaac 

tgccattaca 

agaggctaac 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tgg.:;tgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



<210> 3 

<211> 1746 

<212> DNA 

<213> adenovirus serotype 5 



<400> 3 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



caagaccgtc 

caactgtgcc 

ctggggtact 

tcaaaatggg 

ccactgtgag 

ccctcacagt 

gcaacacact 

ttgccaccca 

ccctcaccac 

ccactggtag 

gactaaagta 

ctggtccagg 

gttttgattc 

aaaacagacg 

taagactagg 

acaaaggcct 

gcactgccaa 

ttgaatttgg 

gcctagaatt 

acagcacagg 

caccagctcc 

tcttaacaaa 

gtttggctcc 

atggagtgct 

atcttactga 

atccaaaatc 

gagacaaaac 

acacaactcc 

ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 

cggggctcct 

tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
atctcctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 

cctccctttg 

ctatccgaac 

tctctggacg 

aaaaaaacca 

gccctaactg 

tcacaggccc 

acagtgtcag 

agtaccctta 

gacttgaaag 

ttgcatgtaa 

aataatactt 

atgcaactta 

gatgttagtt 

ctttttataa 

acagcttcaa 

tttgacgcta 

gcaccaaaca 

aaggctatgg 

gtaggaaaca 

tgttcactaa 

caaatacttg 

acagttcaaa 

tccttcctgg 

tatacaaacg 

actgccaaaa 

acactaacca 

tctatgtcat 

tcctcttaca 



tgtatccata 

tatcccccaa 

ctctagttac 

aggccggcaa 

agtcaaacat 

tggctgccgc 

cgctaaccgt 

aaggaaagct 

ctatcactgc 

agcccattta 

cagacgacct 

ccttgcaaac 

atgtagcagg 

atccgtttga 

actcagccca 

acaattccaa 

cagccatagc 

caaatcccct 

ttcctaaact 

aaaataatga 

atggaggcgg 

ctacagtttc 

gtgctcatct 

acccagaata 

ctgttggatt 

gtaacattgt 

ttacactaaa 

tttcatggga 

ctttttcata 



tgacacggaa 

tgggtttcaa 

ctccaatggc 

ccttacctcc 

aaacctggaa 

cgcacctcta 

gcacgactcc 

agccctgcaa 

ctcaccccct 

tacacaaaat 

aaacactttg 

taaagttact 

aggactaagg 

tgctcaaaac 

caacttggat 

aaagcttgag 

cattaatgca 

caaaacaaaa 

aggaactggc 

taagctaact 

agatgctaaa 

agttttggct 

tattataaga 

ttggaacttt 

tatgcctaac 

cagtcaagtt 

cggtacacag 

ct ggtctggc 

cattgcccaa 



<210> 4 

<211> 1737 

<212> DNA 

<213> adenovirus serotype 5 



<400> 4 

atgcggcgcg 

gcggcgccag 

gtgcctccgc 

gcacccctat 

tccctgaact 

tacagcccgg 

ggcgacctga 

aataagttta 



cggcgatgta 
tggcggcggc 
ggtacctgcg 
tcgacaccac 
accagaacga 
gggaggcaag 
aaaccatcct 
aggcgcgggt 



tgaggaaggt 
gctgggttct 
gcctaccggg 
ccgtgtgtac 
ccacagcaac 
cacacagacc 
gcataccaac 
gatggtgtcg 



cctcctccct 
cccttcgatg 
gggagaaaca 
ctggtggaca 
tttctgacca 
atcaatcttg 
atgccaaatg 
cgcttgccta 



cctacgagag 
ctcccctgga 
gcatccgtta 
acaagtcaac 
cggtcattca 
acgaccggtc 
tgaacgagtt 
ctaaggacaa 



tgtggtgagc 
cccgccgttt 
ctctgagttg 
ggatgtggca 
aaacaatgac 
gcactggggc 
catgtttacc 
tcaggtggag 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1746 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1746 



60 
120 
180 
240 
300 
360 
420 
480 
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ctgaaatacg agtgggtgga gttcacgctg cccgagggca actactccga gaccatgacc 540 

atagacctta tgaacaacgc gatcgtggag cactacttga aagtgggcag acagaacggg 600 

gttctggaaa gcgacatcgg ggtaaagttt gacacccgca acttcagact ggggtttgac 660 

cccgtcactg gtcttgtcat gcctggggta tatacaaacg aagccttcca tccagacatc 72 0 

attttgctgc caggatgcgg ggtggacttc acccacagcc gcctgagcaa cttgttgggc 780 

atccgcaagc ggcaaccctt ccaggagggc tttaggatca cctacgatga tctggagggt 84 0 

ggtaacattc ccgcactgtt ggatgtggac gcctaccagg cgagcttgaa agatgacacc 90 0 

gaacagggcg ggggtggcgc aggcggcagc aacagcagtg gcagcggcgc ggaagagaac 960 

tccaacgcgg cagccgcggc aatgcagccg gtggaggaca tgaacgatag ccgcggctac 102 0 

ccctacgacg tgcccgacta cgcgggcacc agcgccacac gggctgagga gaagcgcgct 10 8 0 

gaggccgaag cagcggccga agctgccgcc cccgctgcgc aacccgaggt cgagaagcct 1140 

cagaagaaac cggtgatcaa acccctgaca gaggacagca agaaacgcag ttacaaccta 12 0 0 

ataagcaatg acagcacctt cacccagtac cgcagctggt accttgcata caactacggc 1260 

gaccctcaga ccggaatccg ctcatggacc ctgctttgca ctcctgacgt aacctgcggc 132 0 

tcggagcagg tctactggtc gttgccagac atgatgcaag accccgtgac cttccgctcc 13 80 

acgcgccaga tcagcaactt tccggtggtg ggcgccgagc tgttgcccgt gcactccaag 144 0 

agcttctaca acgaccaggc cgtctactcc caactcatcc ■ gccagtttac ctctctgacc 1500 

cacgtgttca atcgctttcc cgagaaccag attttggcgc gcccgccagc ccccaccatc 1560 

accaccgtca gtgaaaacgt tcctgctctc acagatcacg ggacgctacc gctgcgcaac 1620 

agcatcggag gagtccagcg agtgaccatt actgacgcca gacgccgcac ctgcccctac 1680 

gtttacaagg ccctgggcat agtctcgccg cgcgtcctat cgagccgcac tttttga 1737 

<210> 5 
. <211> 20 
<212> DNA 

<213> adenovirus serotype 5 
<400> 5 

gaacaggagg tgagcttaga 2 0 

<210> 6 

<211> 43 

: <212> DNA 

<213> adenovirus serotype 5 

<400> 6 

tccgcctcca tttagtgaac agttaggaga tggagctggt gtg 43 

<210> 7 
<211> 44 
<212> DNA 

<213> adenovirus serotype 5 
<400> 7 

tcactaaatg gaggcggaga tgctaaactc actttggtct taac 44 

<210> 8 
<211> 20 
<212> DNA 

<213> adenovirus serotype 5 
<400> 8 

gtggcaggtt gaatactagg 20 

<210> 9 
<211> 8 
<212> PRT 

<213> adenovirus serotype 5 
<400> 9 

His Ala lie Arg Gly Asp Thr Phe 
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<210> 10 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> tnofified sequence for penton protein 
<400> 10 

Ser Arg Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Thr Ser 
15 10 15 

<210> 11 

<211> 57 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 11 

cgcggaagag aactccaacg cggcagccgc ggcaatgcag ccggtggagg acatgaa 57 

<210> 12 

<211> 59 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 12 

tatcgttcat gtcctccacc ggctgcattg ccgcggctgc cgcgttggag ttctcttcc 59 

<210> 13 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 
<400> 13 

cgatagccgc ggctacccct acgacgtgcc cgactacgcg ggcaccagcg ccacacgggc 60 
tgaggagaag cgcgc 75 

<210> 14 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 
<400> 14 

tcagcgcgct tctcctcagc ccgtgtggcg ctggtgcccg cgtagtcggg cacgtcgtag 6 0 

gggtagccgc ggc ~ 73 

<210> 15 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 
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<400> 15 

ggctccggct ccgagaggtg ggctcacagt ggttacattt 40 

<210> 16 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 16 

ggagccggag cctcaaacat aaacctggaa at 

<210> 17 d>z 

<211> 27 

<212> DNA 

<213> Artificial Sqquqticb 
<220> 

<223> amplification primer 

<400> 17 

ctctagaaat ggacggaatt attacag 27 

<210> 18'' 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 18 

tcttggtcat ctgcaacaac atgaagatag tg 32 

<210> 19 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 19 

gttgttgcag atgaccaaga gagtccggct ca 32 

<210> 20 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 
<400> 20 

agcaattgaa aaataaacac gttgaaacat aacacaaacg attctttagt tgtcgtcttc 60 
tgtaatgtaa gaa '""* w 

<210> 21 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> amplification primer 

<400> 21 
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agcaattgaa aaataaacac gttg 24 

<210> 22 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 22 

gaacaggagg tgagcttaga 2Q 

<210> 23 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 23 

gttaggtgga gggtttattc cggtccacaa agttagctta tc 42 

<210> 24 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 24 

gataagctaa ctttgtggac cggaataaac cctccaccta ac 42 

<210> 25 

<211> 20 

<212> DNA 

<213> Ax-tificial Sequence 
<220> 

<223> amplification primer 

<400> 25 

gtggcaggtt gaatactagg 20 

<210> 26 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 26 

gttaggagat ggagctggtg tagtccataa ggtgttaata c 41 

<210> 27 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<400> 27 

gtattaacac cttatggact acaccagctc catctcctaa c 



41 
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<210> 28 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 28 

tgcgcaaaaa caatcaccac gacaatcaca atgtacattg gaagaaatca tacg 54 

<210> 29 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> amplification primer 

<400> 29 

acattgtgat tgtcgtggtg attgtttttg cgcatatgcc atacaatttg aatg 54 

<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> RGD targeting peptide 
<400> 30 

His Cys Asp Cys Arg Gly Asp Cys Phe Cys 
15 10 



<210> 


31 


<211> 


32 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


amplification primer 


<400> 


31 



ttcttttcat ctgcaacaac atgaagatag tg 

<210> 
<211> 
<212> 
<213> 
<220> 

<223> amplification primer 
<400> 32 

gttgttgcag atgaaaagaa ccagaattga ag 



32 



32 



32 
32 
DNA 

Artificial Sequence 



<210> 33 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 
<400> 33 

tgcaattgaa aaataaacac gttgaaacat aacacaaacg attctttatt cttcagttat 60 
gtagcaaaat aca 73 
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<210> 34 

<211> 56 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 34 

agtacaaaaa caatcaccac gacaatcaca gtttatctcg ttgtagacga cactga 56 

<210> 35 

<211> 51 

<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> amplification primer 

<400> 35 

tgtgattgtc gtggtgattg tttttgtact agtgggtatg cttttacttt t 51 

<210> 36 

<211> 4 

<212> PRT 

<213> Adenovirus type 5 

<400> 36 

Thr Leu Trp Thr 

1 

<210> 37 

<211> 7 

<212> PRT 

<213> SV40 

<400> 37 

Pro Lys Lys Lys Arg Lys Val 
1 5 

<210> 38 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 38 

cttcgatgat gccgcagtg 19 

<210> 39 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 39 

gggctcaggt actccgagg 19 

<210> 40 
<211> 25 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<2 23> amplification primer 
<400> 40 

ttacatgcac atctcgggcc aggac 25 

<210> 41 
<211> 7607 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid GRE5 -E1-SV4 0-Hygro 
<400> 41 

tctagaagat ccgctgtaca ggatgttcta gctactttat tagatccgct gtacaggatg 60 
ttctagctac tttattagat ccgctgtaca ggatgttcta gctactttat tagatccgct 12 0 
gtacaggatg ttctagctac tttattagat ccgtgtacag gatgttctag ctactttatt 180 
agatcgatct cctggccgtt cggggtcaaa aaccaggttt ggctataaaa gggggtgggg 240 
gcgcgttcgt cctcactctc ttccgcatcg ctgtctgcga gggccaggat cgatcctgag 300 
aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat tgtaaaattc 360 
atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag atgtcccttg 420 
tatcaccatg gaccctcatg ataattttgt ttctttcact ttctactctg ttgacaacca 480 
ttgtctcctc ttattttctt ttcattttct gtaacttttt cgttaaactt tagcttgcat 540 
ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta ctttctctaa 600 
tcactttttt ttcaaggcaa tcagggtata ttatattgta cttcagcaca gttttagaga 660 
acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc tggctggcgt 720 
ggaaatattc ttattggtag aaacaactac atcctggtca tcatcctgcc tttctcttta 7 80 
tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc caaaccgggc 84 0 
ccctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 900 
gttattgtgc tgtctcatca ttttggcaaa gaattagatc taagcttctg cagctcgagg 960 
actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 102 0 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 1080 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga cggcccccga 114 0 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 1200- 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 12 6 0 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 1320 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 13 80 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag caccccgggc acggttgcag 144 0 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 1500 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 162 0 
ttttgtattg tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 16 8 0 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 1740 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 192 0 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1980 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2 04 0 
ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 210 0 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 222 0 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 2340 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 2400 
gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2460 
ggggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 252 0 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2580 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cccgagagcc 2640 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 2700 
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gcattttgac 
cttgtgaggc 
ctgagtgtat 
cgcagaagta 
aggaggctat 
gcaaacttgt 
tagatacgga 
ttggcatgga 
cggttttcct 
atacctgtgt 
ggaagggggt 
ggtgtacctt 
actgtggttg 
gcaactgcga 
tgaagaccat 
tactgacccg 
gcaatttgag 
acggggtgtt 
ccaggtgcag 
atgtgaccga 
gctctagcga 
gaaagaatat 
ccgccatgag 
tgcccccatg 
tcctgcccgc 
ctgcagcctc 
ttgctttcct 
agttgacggc 
agcagctgtt 
cggtttaaaa 
gctgtctcag 
ttaaagcaga 
aatttcacaa 
aatgtatctt 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 
actfcggtctg 
tttcgttcat 
ttaccatctg 
ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 



aattacagag 
tacagaggag 
tacttttcaa 
ttccatagag 
tagggtatat 
aaatatcagg 
ggatagggtg 
cggggtggtt 
ggccaatacc 
ggaagcctgg 
ggtgtgtcgc 
gggtatcctg 
cttcatgcta 
ggacagggcc 
tcacgtagcc 
ctgttccttg 
tcacactaag 
tgacatgacc 
accctgcgag 
ggagctgagg 
tgaagataca 
ataaggtggg 
caccaactcg 
ggccggggtg 
aaactctact 
cgccgccgct 
gagcccgctt 
tcttttggca 
ggatctgcgc 
cataaataaa 
ctgactgctt 
acttgtttat 
ataaagcatt 
atcatgtctg 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
tgtccgcctt 
tcagttcggt 
ccgaccgctg 
tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 



gatgggcagg 
gctaggaatc 
cagatcaagg 
cagctgacca 
gcaaaggtgg 
aattgttgct 
gcctttagat 
attatgaatg 
aaccttatcc 
accgatgtaa 
cccaaaagca 
tctgagggta 
gtgaaaagcg 
tctcagatgc 
agccactctc 
catttgggta 
atattgcttg 
atgaagatct 
tgtggcggta 
cccgatcact 
gattgaggta 
ggtcttatgt 
tttgatggaa 
cgtcagaatg 
accttgacct 
tcagccgctg 
gcaagcagtg 
caattggatt 
cagcaggttt 
aaaccagact 
aagtcgcaag 
tgcagcttat 
tttttcactg 
gtcgactcta 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 
ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtag : 
tcaagaagat 
ttaagggatt 
aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtak 
cccggcgtca 
tggaaaacgt 



ggctaaaggg 
tagcttttag 
ataattgcgc 
cttactggct 
cacttaggcc 
acatttctgg 
gtagcatgat 
taaggtttac 
tacacggtgt 
gggttcgggg 
gggcttcaat 
actccagggt 
tggctgtgat 
tgacctgctc 
gcaaggcctg 
acaggagggg 
agcccgagag 
ggaaggtgct 
aacatattag 
tggtgctggc 
ctgaaatgtg 
agttttgtat 
gcattgtgag 
tgatgggctc 
acgagaccgt 
cagccaccgc 
cagcttcccg 
ctttgacccg 
ctgccctgaa 
ctgtttggat 
ccgaattgga 
aatggttaca 
cattctagtt 
gactcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 

ggtggttttt 

cctttgatct 
ttggtcatga 
tttaaatcaa 
agtgaggcac 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 



ggtaaagagg 
cttaatgacc 
taatgagctt 
gcagccaggg 
agattgcaag 
gaacggggcc 
aaatatgtgg 
tggccccaat 
aagcttctat 
ctgtgccttt 
taagaaatgc 
gcgccacaat 
taagcataac 
ggacggcaac 
gccagtgttt 
ggtgttccta 
catgtccaag 
gaggtacgat 
gaaccagcct 
ctgcacccgc 
tgggcgtggc 
ctgttttgca 
ctcatatttg 
cagcattgat 
gtctggaacg 
ccgcgggatt 
ttcatccgcc 
ggaacttaat 
ggcttcctcc 
ttggatcaag 
tccaattcgg 
aataaagcaa 

gtggtttgtc 

cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 
tctaaagtat 
ctatctcagc 
taactacgat 
cacgctcacc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagttacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 



gagcgggggg 
agacaccgtc 
gatctgctgg 
gatgattttg 
tacaagatca 
gaggtggaga 
ccgggggtgc 
tttagcggta 
gggtttaaca 
tactgctgct 
ctctttgaaa 
gtggcctccg 
atggtatgtg 
tgtcacctgc 
gagcataaca 
ccttaccaat 
gtgaacctga 
gagacccgca 
gtgatgctgg 
gctgagtttg 
ttaagggtgg 
gcagccgccg 
acaacgcgca 
ggtcgccccg 
ccgttggaga 
gtgactgact 
cgcgatgaca 
gtcgtttctc 
cctcccaatg 
caagtgtctt 
atcgatctta 
tagcatcaca 
caaactcatc 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
tagctcacgc 
gcacgaaccc 
caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 
atatgagtaa 
gatctgtcta 
acgggagggc 
ggctccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
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ccgctgttga gatccagttc gatgtaaccc 
tttactttca ccagcgtttc tgggtgagca 
ggaataaggg cgacacggaa atgttgaata 
agcatttatc agggttattg tctcatgagc 
aaacaaatag gggttccgcg cacatttccc 
attattatca tgacattaac ctataaaaat 
gcgtttcggt gatgacggtg aaaacctctg 
ttgtctgtaa gcggatgccg ggagcagaca 
cgggtgtcgg ggctggctta actatgcggc 
tatgcggtgt gaaataccgc acagatgcgt 
agcgttaata ttttgttaaa attcgcgtta 
caataggccg aaatcggcaa aatcccttat 
agtgttgttc cagtttggaa caagagtcca 
gggcgaaaaa ccgtctatca gggcgatggc 
tttttggggt cgaggtgccg taaagcacta 
agagcttgac ggggaaagcc ggcgaacgtg 
gcgggcgcta gggcgctggc aagtgtagcg 
gcgcttaatg cgccgctaca gggcgcgtcc 
gaagggcgat cggtgcgggc ctcttcgcta 
gcaaggcgat taagttgggt aacgccaggg 
gccagtgaat tgtaatacga ctcactatag 

<210> 42 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid MMTV-E2a-SV40-Neo 
<400> 42 

gaattccgca ttgcagagat attgtattta 
* tgaccattca ccacattggt gtgcacctcc 
gagtgtccta cacctagggg agaagcagcc 
tctgcgcaca aacggatgag cccatcagac 
ggcatagctc tgctttgcct ggggctattg 
ctcacccttg actcttttaa tagctcttct 
aactcgacct tcctcctgag gcaaggacca 
ttttgtcctt cagaaataga aataagaatg 
accaatccaa taggtagatt attagttact 
actattttta ctcaaattca gaagttagaa 
acctcaattg aagaacaggt gcaaggacta 
aaaaagagtg tttttgtcaa aataggagac 
ccttacatct acagaccaac agatgccccc 
tgggataggt gggttacagt caatggctat 
gaaagactcg ccagagctag acctccttgg 
atgaaacaac aggtacatga ttatatttat 
attttccata ccaaggaggg gacagtggct 
catggcatga gttattatga atagccttta 
agtaagtttt tggttacaaa ctgttcttaa 
cttggtttgg tatcaaaggt tctgatctga 
ttggaattta tccaaatctt atgtaaatgc 
attttttgag taaacttgca acagtcctaa 
gccatcccgt ctccgctcgt cacttatcct 
cggcgaccct caggtcggcc gactgcggca 
gtgacccttg tctctatttc tactatttgg 
ggctatcatc acaagagcgg aacggactca 
cgtccaagac cctcaaagat ttttggcact 
gcgccctgcc gcaaggccag ctgcttgtcc 
ggtatcttgc agttttggaa aaagatgtga 
ac 99ggtaga agttgaggcg cgggttgggc 
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actcgtgcac ccaactgatc ttcagcatct 642 0 
aaaacaggaa ggcaaaatgc cgcaaaaaag 6480 
ctcatactct tcctttttca atattattga 6540 
ggatacatat ttgaatgtat ttagaaaaat 6600 
cgaaaagtgc cacctgacgt ctaagaaacc 6660 
aggcgtatca cgaggcccct ttcgtctcgc 672 0 
acacatgcag ctcccggaga cggtcacagc 67 80 
agcccgtcag ggcgcgtcag cgggtgttgg 684 0 
atcagagcag attgtactga gagtgcacca 6 90 0 
aaggagaaaa taccgcatca ggaaattgta 6960 
aatttttgtt aaatcagctc attttttaac 7020 
aaatcaaaag aatagaccga gatagggttg 7080 
ctattaaaga acgtggactc caacgtcaaa 714 0 
ccactacgtg aaccatcacc ctaatcaagt 72 0 0 
aatcggaacc ctaaagggag cccccgattt 7260 
gcgagaaagg aagggaagaa agcgaaagga 732 0 
gtcacgctgc gcgtaaccac cacacccgcc 738 0 
cattcgccat tcaggctgcg caactgttgg 7440 
ttacgccagc tggcgaaagg gggatgtgct 750 0 
ttttcccagt cacgacgttg taaaacgacg 7560 
ggcgaattaa ttcgggg 7607 



agtgcctagc tcgatacaat aaacgccatt 60 
aagcttgggc agaaatggtt gaactcccga 120 
aaggggttgt ttcccaccaa ggacgacccg 180 
aaagacatat tcattctctg ctgcaaactt 24 0 
ggggaagttg cggttcgtgc tcgcagggct 3 00 
gtgcaagatt acaatctaaa caattcggag 360 
cagccaactt cctcttacaa gccgcatcga 42 0 
cttgctaaaa attatatttt taccaataag 480 
atgttaagaa atgaatcatt atcttttagt 54 0 
atgggaatag aaaatagaaa gagacgctca 60 0 
ttgaccacag gcctagaagt aaaaaaggga 66 0 
sggtggtggc aaccagggac ttatagggga 72 0 
ttaccatata caggaagata tgacttaaat 780 
aaagtgttat atagatccct cccttttcgt 84 0 
tgtatgttgt ctcaagaaga aaaagacgac 9v0 
ctaggaacag gaatgcactt ttggggaaag 960 
ggactaatag aacattattc tgcaaaaact 102 0 
ttggcccaac cttgcggttc ccagggctta 1080 
aacgaggatg tgagacaagt ggtttcctga 114 0 
gctctgagtg ttctattttc ctatgttctt 1200 
ttatgtaaac caagatataa aagagtgctg 1260 
cattcacctc ttgtgtgttt gtgtctgttc 1320 
tcactttcca gagggtcccc ccgcagaccc 13 80 
gctggcgccc gaacagggac cctcggataa 144 0 
tgtttgtctt gtattgtctc tttcttgtct 1500 
ccatagggac caagctagcg cttctcgtcg 1560 
tcgttgagcg aggcgatatc aggtatgaca 1620 
gctcggctgc ggttggcacg gcaggatagg 168 0 
taggtggcaa gcacctctgg cacggcaaat 1740 
tcgcatgtgc cgttttcttg gcgtttgggg 1800 
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ggtacgcgcg gtgagaatag gtggcgttcg taggcaaggc tgacatccgc tatggcgagg I860 
ggcacatcgc tgcgctcttg caacgcgtcg cagataatgg cgcactggcg ctgcagatgc 1920 
ttcaacagca cgtcgtctcc cacatctagg tagtcgccat gcctttcgtc cccccgcccg 1980 
acttgttcct cgtttgcctc tgcgttgtcc tggtcttgct ttttatcctc tgttggtact 2040 
gagcggtcct cgtcgtcttc gcttacaaaa cctgggtcct gctcgataat cacttcctcc 2100 
tcctcaagcg ggggtgcctc gacggggaag gtggtaggcg cgttggcggc atcggtggag 2160 
gcggtggtgg cgaactcaga gggggcggtt aggctgtcct tcttctcgac tgactccatg 2220 
atctttttct gcctatagga gaaggaaatg gccagtcggg aagaggagca gcgcgaaacc 22 80 
acccccgagc gcggacgcgg tgcggcgcga cgtcccccaa ccatggagga cgtgtcgtcc 2340 
ccgtccccgt cgccgccgcc tccccgggcg cccccaaaaa agcggatgag gcggcgtatc 24 0 0 
gagtccgagg acgaggaaga ctcatcacaa gacgcgctgg tgccgcgcac acccagcccg 2460 
cggccatcga cctcggcggc ggatttggcc attgcgccca agaagaaaaa gaagcgccct 252 0 
tctcccaagc ccgagcgccc gccat caeca gaggtaatcg tggacagega ggaagaaaga 25 8 0 
gaagatgtgg cgctacaaat ggtgggtttc agcaacccac cggtgctaat caagcatggc 264 0 
aaaggaggta agegcacagt gcggcggctg aatgaagacg acccagtggc gcgtggtatg 2700 
eggaegcaag aggaagagga agagcccagc gaagcggaaa gtgaaattac ggtgatgaac 2760 
ccgctgagtg tgccgatcgt gtctgcgtgg gagaagggca tggaggctgc gcgcgcgctg 2820 
atggacaagt accacgtgga taacgatcta aaggegaact tcaaactact gcctgaccaa 2880 
gtggaagctc tggcggccgt atgeaagace tggctgaacg aggagcaccg cgggttgcag 294 0 
ctgaccttca ccagcaacaa gacctttgtg acgatgatgg ggegattect geaggegtae 3000 
ctgcagtcgt ttgcagaggt gacctacaag catcacgagc ccacgggctg cgcgttgtgg 3060 
ctgcaccgct gegctgagat egaaggegag cttaagtgtc tacaeggaag cattatgata 3120 
aataaggagc acgtgattga aatggatgtg aegagegaaa aegggcageg cgcgctgaag 3180 
gagcagtcta geaaggecaa gatcgtgaag aaccggtggg gecgaaatgt ggtgcagatc 3240 
tccaacaccg aegcaaggtg ctgcgtgcac gacgcggcct gtccggccaa tcagttttcc 3300 
ggcaagtctt gcggcatgtt cttctctgaa ggegcaaagg ctcaggtggc ttttaagcag 3360 
atcaaggctt ttatgeagge gctgtatcct aacgcccaga ccgggcacgg tcaccttttg 3420 
atgccactac ggtgcgagtg caactcaaag cctgggcacg cgcccttttt gggaaggcag 34 80 
ctaccaaagt tgactccgtt cgccctgagc aacgeggagg acctggacgc ggatctgatc 3540 
tccgacaaga gcgtgctggc cagcgtgcac cacccggcgc tgatagtgtt ccagtgctgc 360 0 
aaccctgtgt atcgcaactc gcgcgcgcag ggeggaggee ccaactgcga cttcaagata 3 66 0 
tcggcgcccg acctgetaaa cgcgttggtg atggtgcgca gcctgtggag tgaaaacttc 372 0 
accgagctgc cgcggatggt tgtgcctgag tttaagtgga gcactaaaca ccagtatcgc 3 7 80 
aacgtgtccc tgccagtggc geatagegat gegeggcaga acccctttga tttttaaacg 384 0 
gegcagaegg caagggtggg ggtaaataat cacccgagag tgtacaaata aaagcatttg 3900 
cctttattga aagtgtctct agtacattat ttttacatgt ttttcaagtg acaaaaagaa 3960 
gtggcgctcc taatctgege actgtggctg eggaagtagg gcgagtggcg ctccaggaag 402 0 
ctgtagagct gttcctggtt gegaegcagg gtgggctgta cctggggact gttgagcatg 4080 
gagttgggta ccccggtaat aaggttcatg gtggggttgt gatccatggg agtttggggc 4140 
cagttggcaa aggcgtggag aaacatgcag cagaatagtc cacaggcggc cgagttgggc 42 0 0 
ccctgtacgc tttgggtgga cttttccagc gttatacagc ggtcggggga agaagcaatg 4260 
gcgctacggc gcaggagtga ctcgtactca aactggtaaa ectgettgag tegctggtea 432 0 
gaaaagccaa agggctcaaa gaggtagcat gtttttgagt gcgggttcca ggcaaaggee 43 8 0 
atccagtgta cgcccccagt ctcgcgaccg geegtattga etatggegea ggcgagcttg 444 0 
tgtggagaaa caaagectgg aaagcgcttg tcataggtgc ccaaaaaata tggcccacaa 4500 
ccaagatctt tgacaatggc tttcagttcc t gctcactgg agcccatggc ggcagctgtt 4560 
gttgatgttg ettgettett tatgttgtgg cgttgccggc cgagaagggc gtgcgcaggt 462 0 
acaeggttte gatgacgecg cggtgcggcc ggtgcacacg gaccacgtca aagacttcaa 4680 
acaaaacata aagaagggtg ggctcgtcca tgggatccat atatagggee cgggttataa 474 0 
ttacctcagg tcgacctcga gggatctttg tgaaggaacc ttacttctgt ggtgtgacat 4800 
aattggacaa actacctaca gagatttaaa gctctaaggt aaatataaaa tttttaagtg 4 86 0 
tataatgtgt taaactactg attctaattg tttgtgtatt ttagattcca acctatggaa 4 92 0 
ctgatgaatg ggagcagtgg tggaatgect ttaatgagga aaacctgttt tgctcagaag 4980 
aaatgecate tagtgatgat gaggctactg ctgactctca acattctact cctccaaaaa 5040 
agaagagaaa ggtagaagac cccaaggact ttccttcaga attgetaagt tttttgagtc 5100 
atgctgtgtt tagtaataga actcttgett getttgetat ttacaccaca aaggaaaaag 5160 
ctgcactgct atacaagaaa attatggaaa aatattctgt aacctttata agtaggcata 5220 
acagttataa tcataacata ctgttttttc ttactccaca caggcataga gtgtctgcta 5280 
ttaataacta tgctcaaaaa ttgtgtacct ttagcttttt aatttgtaaa ggggttaata 5340 
aggaatattt gatgtatagt gecttgacta gagatcataa tcagccatac cacatttgta 5400 
gaggttttac ttgctttaaa aaacctccca cacctccccc tgaacctgaa acataaaatg 5460 
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aatgcaattg ttgttgttaa cttgtttatt gcagcttata atggttacaa ataaagcaat 552 0 
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 5580 
aaactcatca atgtatctta tcatgtctgg atccggctgt ggaatgtgtg tcagttaggg 564 0 
tgtggaaagt ccccaggctc cccagcaggc agaagtatgc aaagcatgca tctcaattag 570 0 
tcagcaacca ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg 5760 
catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc gcccctaact 582 0 
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat ttatgcagag 5880 
gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt ttttggaggc 594 0 
ctaggctttt gcaaaaagct tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 6000 
ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 60 60 
gctactgggc tatctggaca agggaaaacg caagcgcaaa* gagaaagcag gtagcttgca 6120 
gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat 6180 
tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt 624 0 
tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 6300 
gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 63 6 0 
agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 6420 
tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 648 0 
tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 654 0 
gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 6600 
tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 6660 
ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 672 0 
cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 67 80 
atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 6 84 0 
gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 690 0 
tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc 6960 
gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 7020 
ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 7080 
atcgccttct tgacgagttc ttctgagcgg gactctgggg ttcgaaatga ccgaccaagc 7140 
gacgcccaac ctgccatcac gagatttcga ttccaccgcc gccttctatg aaaggttggg 72 00 
cttcggaatc gttttccggg acgccggctg gatgatcctc cagcgcgggg atctcatgct 7260 
ggagttcttc gcccaccccg ggctcgatcc cctcgcgagt tggttcagct gctgcctgag 7320 
gctggacgac ctcgcggagt tctaccggca gtgcaaatcc gtcggcatcc aggaaaccag 73 80 
cagcggctat ccgcgcatcc atgcccccga actgcaggag tggggaggca cgatggccgc 7440 
tttggtcccg gatctttgtg aaggaacctt acttctgtgg tgtgacataa ttggacaaac 7500 
tacctacaga gatttaaagc tctaaggtaa atataaaatt tttaagtgta taatgtgtta 7560 
aactactgat tctaattgtt tgtgtatttt agattccaac ctatggaact gatgaatggg 7620 
agcagtggtg gaatgccttt aatgaggaaa acctgttttg ctcagaagaa atgccatcta 7680 
gtgatgatga ggctactgct gactctcaac attctactcc tccaaaaaag aagagaaagg 7740 
tagaagaccc caaggacttt ccttcagaat tgctaagttt tttgagtcat gctgtgttta 7800 
gtaatagaac tcttgcttgc tttgctattt acaccacaaa ggaaaaagct gcactgctat 7 860 
acaagaaaat tatggaaaaa tattctgtaa cctttataag taggcataac agttataatc 792 0 
ataacatact gttttttctt actccacaca ggcatagagt gtctgctatt aataactatg 7980 
ctcaaaaatt gtgtaccttt agctttttaa tttgtaaagg ggttaataag gaatatttga 8040 
tgtatagtgc cttgactaga gatcataatc agccatacca catttgtaga ggttttactt 810 0 
gctttaaaaa acctcccaca cctccccctg aacctgaaac ataaaatgaa tgcaattgtt 816 0 
gttgttaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 822 0 
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 8280 
gtatcttatc atgtctggat ccccaggaag ctcctctgtg tcctcataaa ccctaacctc 8340 
ctctacttga gaggacattc caatcatagg ctgcccatcc accctctgtg tcctcctgtt 8400 
aattaggtca cttaacaaaa aggaaattgg gtaggggttt ttcacagacc gctttctaag 8460 
ggtaatttta aaatatctgg gaagtccctt ccactgctgt gttccagaag tgttggtaaa 852 0 
cagcccacaa atgtcaacag cagaaacata caagctgtca gctttgcaca agggcccaac 858 0 
accctgctca tcaagaagca ctgtggttgc tgtgttagta atgtgcaaaa caggaggcac 8640 
attttcccca cctgtgtagg ttccaaaata tctagtgttt tcatttttac ttggatcagg 8700 
aacccagcac tccactggat aagcattatc cttatccaaa acagccttgt ggtcagtgtt 8760 
catctgctga ctgtcaactg tagcattttt tggggttaca gtttgagcag gatatttggt 882 0 
cctgtagttt gctaacacac cctgcagctc caaaggttcc ccaccaacag caaaaaaatg 8 880 
aaaatttgac ccttgaatgg gttttccagc accattttca tgagtttttt gtgtccctga 8940 
atgcaagttt aacatagcag ttaccccaat aacctcagtt ttaacagtaa cagcttccca 9000 
catcaaaata tttccacagg ttaagtcctc atttaaatta ggcaaaggaa ttcttgaaga 9060 
cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct 9120 
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tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 9180 
taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 9240 
tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 93 00 
gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 9360 
gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc 942 0 
cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta 9480 
tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 954 0 
tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc 9600 
atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac 9660 
ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg 9720 
gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac 9780 
gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc 9840 
gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt 990 0 
gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga 9960 
gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc 10020 
cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag 10080 
atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca 1014 0 
tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc 10200 
ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca 10260 
gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc 10320 
tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta 10380 
ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt 10440 
ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc 10500 
gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg 10560 
ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg 1062 0 
tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag 10680 
ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 10740 
agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat 10800 
agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg 10860 
gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc 10920 
tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt 10980 
accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 11040 
gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt 11100 
atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc 11160 
cagtatctgc tccctgcttg tgtgttggag gtcgctgagt agtgcgcgag caaaatttaa 11220 
gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga atctgcttag ggttaggcgt 112 80 
tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta tctgagggga ctagggtgtg 11340 
tttaggcgaa aagcggggct tcggttgtac gcggttagga gtcccctcag gatatagtag 11400 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc aatacacttg tagtcttgca 11460 
acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc 1152 0 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag acgggtctga 11580 
catggattgg acgaaccact ~ 11600 

<210> 43 
<211> 35211 

<212> DNA » 
<213> Artificial Sequence 

<220> 

<223> Plasmid AvlnBg 
<400> 43 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 12 0 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 24 0 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 3 60 
gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 42 0 
cgggtcaaag ttggcgtttt attattatag tcagtacgta ccagtgcact ggcctaggaa 480 
gcttggtacc ggtgaattcg ctagcgttcg cgccccgatg tacgggccag atatacgcgt 54 0 
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atctgagggg actagggtgt gtttaggcga 
agtcccctca ggatatagta gtttcgcttt 
caatactctt gtagtcttgc aacatggtaa 
agaaaaagca ccgtgcatgc cgattggtgg 
gaaggcaaca gacgggtctg acatggattg 
tattgtattt aagtgcctag ctcgatacaa 
tgtgcacctc cggccctggc cactctcttc 
ggctcgcggt tgaggacaaa ctcttcgcgg 
tcggcctccg aacggtactc cgccgccgag 
ggaaaacctc tcgagaaagg cgtgtaacca 
ccgggctgca ggaattcgat ctagatggat 
gaccccaagg actttccttc agaattgcta 
ttacaacgtc gtgactggga aaaccctggc 
ccccctttcg ccagctggcg taatagcgaa 
ttgcgcagcc tgaatggcga atggcgcttt 
gaaagctggc tggagtgcga tcttcctgag 
cagatgcacg gttacgatgc gcccatctac 
ccgccgtttg ttcccacgga gaatccgacg 
gaaagctggc tacaggaagg ccagacgcga 
catctgtggt gcaacgggcg ctgggtcggt 
tttgacctga gcgcattttt acgcgccgga 
tggagtgacg gcagttatct ggaagatcag 
gacgtctcgt tgctgcataa accgactaca 
tttaatgatg atttcagccg cgctgtactg 
cgtgactacc tacgggtaac agtttcttta 
accgcgcctt tcggcggtga aattatcgat 
ctacgtctga acgtcgaaaa cccgaaactg 
gcggtggttg aactgcacac cgccgacggc 
ggtttccgcg aggtgcggat tgaaaatggt 
attcgaggcg ttaaccgtca cgagcatcat 
acgatggtgc aggatatcct gctgatgaag 
-cattatccga accatccgct gtggtacacg 
gatgaagcca atattgaaac ccacggcatg 
cgctggctac cggcgatgag cgaacgcgta 
ccgagtgtga tcatctggtc gctggggaat 
ctgtatcgct ggatcaaatc tgtcgatcct 
gccgacacca cggccaccga tattatttgc 
cccttcccgg ctgtgccgaa atggtccatc 
cgcccgctga tcctttgcga atacgcccac 
aaatactggc aggcgtttcg tcagtatccc 
gtggatcagt cgctgattaa atatgatgaa 
gattttggcg atacgccgaa cgatcgccag 
cgcacgccgc atccagcgct gacggaagca 
ttatccgggc aaaccatcga agtgaccagc 
ctcctgcact ggatggtggc gctggatggt 
gatgtcgctc cacaaggtaa acagttgatt 
gccgggcaac tctggctcac agtacgcgta 
gccgggcaca tcagcgcctg gcagcagtgg 
cccgccgcgt cccacgccat cccgcatctg 
ctgggtaata agcgttggca atttaaccgc 
ggcgataaaa aacaactgct gacgccgctg 
aacgacattg gcgtaagtga agcgacccgc 
aaggcggcgg gccattacca ggccgaagca 
gctgatgcgg tgctgattac gaccgctcac 
atcagccgga aaacctaccg gattgatggt 
gaagtggcga gcgatacacc gcatccggcg 
gtagcagagc gggtaaactg gctcggatta 
actgccgcct gttttgaccg ctgggatctg 
ttcccgagcg aaaacggtct gcgctgcggg 
tggcgcggcg acttccagtt caacatcagc 
agccatcgcc atctgctgca cgcggaagaa 



aaagcggggc ttcggttgta cgcggttagg 6 00 
tgcataggga gggggaaatg tagtcttatg 660 
cgatgagtta gcaacatgcc ttacaaggag 72 0 
aagtaaggtg gtacgatcgt gccttattag 78 0 
gacgaaccac tgaattccgc attgcagaga 840 
taaacgccat ttgaccattc accacattgg 900 
cgcatcgctg tctgcggggg ccagctgttg 960 
tctttccagt actcttggat cggaaacccg 1020 
ggacctgagc gagtccgcat cgaccggatc 10 80 
gtcacagtcg ctctagaact agtggatccc 114 0 
aaaggtccaa aaaagaagag aaaggtagaa 1200 
agttttttga gtgattcact ggccgtcgtt 12 60 
gttacccaac ttaatcgcct tgcagcacat 1320 
gaggcccgca ccgatcgccc ttcccaacag 13 80 
gcctggtttc cggcaccaga agcggtgccg 1440 
gccgatactg tcgtcgtccc ctcaaactgg 1500 
accaacgtaa cctatcccat tacggtcaat 1560 
ggttgttact cgctcacatt taatgttgat 1620 
attatttttg atggcgttaa ctcggcgttt 1680 
tacggccagg acagtcgttt gccgtctgaa 1740 
gaaaaccgcc tcgcggtgat ggtgctgcgt 1800 
gatatgtggc ggatgagcgg cattttccgt 1860 
caaatcagcg atttccatgt tgccactcgc 192 0 
gaggctgaag ttcagatgtg cggcgagttg 1980 
tggcagggtg aaacgcaggt cgccagcggc 2 040 
gagcgtggtg gttatgccga tcgcgtcaca 2100 
tggagcgccg aaatcccgaa tctctatcgt 2160 
acgctgattg aagcagaagc ctgcgatgtc 2220 
ctgctgctgc tgaacggcaa gccgttgctg 22 80 
cctctgcatg gtcaggtcat ggatgagcag 2340 
cagaacaact ttaacgccgt gcgctgttcg 2400 
ctgtgcgacc gctacggcct gtatgtggtg 2460 
gtgccaatga atcgtctgac cgatgatccg 2520 
acgcgaatgg tgcagcgcga tcgtaatcac 2580 
gaatcaggcc acggcgctaa tcacgacgcg 2 640 
tcccgcccgg tgcagtatga aggcggcgga 2700 
ccgatgtacg cgcgcgtgga tgaagaccag 2760 
aaaaaatggc tttcgctacc tggagagacg 2 82 0 
gcgatgggta acagtcttgg cggtttcgct 2880 
cgtttacagg gcggcttcgt ctgggactgg 2940 
aacggcaacc cgtggtcggc ttacggcggt 3 000 
ttctgtatga acggtctggt ctttgccgac 3 060 
aaacaccagc agcagttttt ccagttccgt 312 0 
gaatacctgt tccgtcatag cgataacgag 3180 
aagccgctgg caagcggtga agtgcctctg 3240 
gaactgcctg aactaccgca gccggagagc 3300 
gtgcaaccga acgcgaccgc atggtcagaa 3360 
cgtctggcgg aaaacctcag tgtgacgctc 3420 
accaccagcg aaatggattt ttgcatcgag 3480 
cagtcaggct ttctttcaca gatgtggatt 354 0 
cgcgatcagt tcacccgtgc accgctggat 360 0 
attgacccta acgcctgggt cgaacgctgg 3660 
gcgttgttgc agtgcacggc agatacactt 3720 
gcgtggcagc atcaggggaa aaccttattt 3 78 0 
agtggtcaaa tggcgattac cgttgatgtt 3 84 0 
cggattggcc tgaactgcca gctggcgcag 3900 
gggccgcaag aaaactatcc cgaccgcctt 3960 
ccattgtcag acatgtatac cccgtacgtc 4 02 0 
acgcgcgaat tgaattatgg cccacaccag 40 80 
cgctacagtc aacagcaact gatggaaacc 4140 
ggcacatggc tgaatatcga cggtttccat 4200 
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atggggattg gtggcgacga ctcctggagc 
gccggtcgct accattacca gttggtctgg 
atcgataccg tcgaaacttg tttattgcag 
tcacaaattt cacaaataaa gcattttttt 
tcatcaatgt atcttatcat gtctggatcc 
atgagacccg caccaggtgc agaccctgcg 
ctgtgatgct ggatgtgacc gaggagctga 
gcgctgagtt tggctctagc gatgaagata 
gcttaagggt gggaaagaat atataaggtg 
cagcagccgc cgccgccatg agcaccaact 
tgacaacgcg catgccccca tgggccgggg 
atggtcgccc cgtcctgccc gcaaactcta 
cgccgttgga gactgcagcc tccgccgccg 
ttgtgactga ctttgctttc ctgagcccgc 
cccgcgatga caagttgacg gctcttttgg 
atgtcgtttc tcagcagctg ttggatctgc 
cccctcccaa tgcggtttaa aacataaata 
agcaagtgtc ttgctgtctt tatttagggg 
ggtctcggtc gttgagggtc ctgtgtattt 
tgttcagata catgggcata agcccgtctc 
catgctgcgg ggtggtgttg tagatgatcc 
taaaaatgtc tttcagtagc aagctgattg 
caaagcggtt aagctgggat gggtgcatac 
tttttaggtt ggctatgttc ccagccatat 
ccagcacagt gtatccggtg cacttgggaa 
ggaagaactt ggagacgccc ttgtgacctc 
tggcaatggg cccacgggcg gcggcctggg 
agttgtgttc aggatgagat cgtcataggc 
agactgcggt ataatggttc catccggccc 
ttcccacgct ttgagttcag atggggggat 
ggtttccggg gtaggggaga tcagctggga 
accgcagccg gtgggcccgt aaatcacacc 
gctgcagctg ccgtcatccc tg a g ca 9ggg 
catgttttcc ctgaccaaat ccgccagaag 
caaggaagca aagtttttca acggtttgag 
ttgaccaagc agttccaggc ggtcccacag 
cagcatatct cctcgtttcg cgggttgggg 
tcgtccagac gggccagggt catgtctttc 
tgggtcacgg tgaaggggtg cgctccgggc 
gtcctgctgg tgctgaagcg ctgccggtct 
accatggtgt catagtccag cccctccgcg 
gaggaggcgc cgcacgaggg gcagtgcaga 
aataccgatt ccggggagta ggcatccgcg 
acgagccagg tgagctctgg ccgttcgggg 
atgcgtttct tacctctggt ttccatgagc 
tccgtgtccc cgtatacaga cttgagaggc 
tcgtatagaa actcggacca ctctgagaca 
gctaagtggg aggggtagcg gtcgttgtcc 
agacacatgt cgccctcttc ggcatcaagg 
tgaccgggtg ttcctgaagg ggggctataa 
tcttccgcat cgctgtctgc gagggccagc 
ggcatgactt ctgcgctaag attgtcagtt 
tggcccgcgg tgatgccttt gagggtggcc 
ttgttgtcaa gcttggtggc aaacgacccg 
gagcgcaggg tttggttttt gtcgcgatcg 
acgtattcgc gcgcaacgca ccgccattcg 
aggtgcacgc gccaaccgcg gttgtgcagg 
ccgcgtaggc gctcgttggt ccagcagagg 
agggggtcta gctgcgtctc gtccgggggg 
aggcgcgcgt cgaagtagtc tatcttgcat 
cgggcggcaa gcgcgcgctc gtatgggttg 
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ccgtcagtat cggcggaatt tcagctgagc 4260 
tgtcaaaaat aataatctcg aatcaagctt 432 0 
cttataatgg ttacaaataa agcaatagca 4 380 
cactgcattc tagttgtggt ttgtccaaac 4440 
gacctcggat ctggaaggtg ctgaggtacg 4500 
agtgtggcgg taaacatatt aggaaccagc 4560 
ggcccgatca cttggtgctg gcctgcaccc 4 62 0 
cagattgagg tactgaaatg tgtgggcgtg 468 0 
ggggtcttat gtagttttgt atctgttttg 4740 
cgtttgatgg aagcattgtg agctcatatt 4 8 00 
tgcgtcagaa tgtgatgggc tccagcattg 4 860 
ctaccttgac ctacgagacc gtgtctggaa 4 920 
cttcagccgc tgcagccacc gcccgcggga 4 98 0 
ttgcaagcag tgcagcttcc cgttcatccg 5 040 
cacaattgga ttctttgacc cgggaactta 5100 
gccagcaggt ttctgccctg aaggcttcct 5160 
aaaaaccaga ctctgtttgg atttggatca 5220 
ttttgcgcgc gcggtaggcc cgggaccagc 5280 
tttccaggac gtggtaaagg tgactctgga 5340 
tggggtggag gtagcaccac tgcagagctt 5400 
agtcgtagca ggagcgctgg gcgtggtgcc 5460 
ccaggggcag gcccttggtg taagtgttta 5520 
gtggggatat gagatgcatc ttggactgta 5580 
ccctccgggg attcatgttg tgcagaacca 5640 
atttgtcatg tagcttagaa ggaaatgcgt 5700 
caagattttc catgcattcg tccataatga 5760 
cgaagatatt tctgggatca ctaacgtcat 5 820 
catttttaca aagcgcgggc ggagggtgcc 5880 
a ggggcgtag ttaccctcac agatttgcat 5940 
catgtctacc tgcggggcga tgaagaaaac 6000 
agaaagcagg ttcctgagca gctgcgactt 6060 
tattaccggg tgcaactggt agttaagaga 6120 
ggccacttcg ttaagcatgt ccctgactcg 6180 
gcgctcgccg cccagcgata gcagttcttg 6240 
accgtccgcc gtaggcatgc ttttgagcgt 6300 
ctcggtcacc tgctctacgg catctcgatc 6360 
cggctttcgc tgtacggcag tagtcggtgc 6420 
cacgggcgca gggtcctcgt cagcgtagtc 6480 
tgcgcgctgg ccagggtgcg cttgaggctg 6540 
tcgccctgcg cgtcggccag gtagcatttg 6600 
gcgtggccct tggcgcgcag cttgcccttg 6660 
cttttgaggg cgtagagctt gggcgcgaga 672 0 
ccgcaggccc cgcagacggt ctcgcattcc 6780 
tcaaaaacca ggtttccccc atgctttttg 6840 
cggtgtccac gctcggtgac gaaaaggctg 6900 
ctgtcctcg*.; gcggtgttcc gcggtcctcc 6960 
aaggctcgcg tccaggccag cacgaaggag 7020 
actagggggt ccactcgctc cagggtgtga 7080 
aaggtgattg gtttgtaggt gtaggccacg 7140 
aagggggtgg gggcgcgttc gtcctcactc 7200 
tgttggggtg agtactccct ctgaaaagcg 7260 
tccaaaaacg aggaggattt gatattcacc 7320 
gcatccatct ggtcagaaaa gacaatcttt 7380 
tagagggcgt tggacagcaa cttggcgatg 7440 
gcgcgctcct tggccgcgat gtttagctgc 7500 
ggaaagacgg tggtgcgctc gtcgggcacc 7560 
gtgacaaggt caacgctggt ggctacctct 7620 
cggccgccct tgcgcgagca gaatggcggt 7680 
tctgcgtcca cggtaaagac cccgggcagc 7740 
ccttgcaagt ctagcgcctg ctgccatgcg 7800 
agtgggggac cccatggcat ggggtgggtg 7 860 
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agcgcggagg cgtacatgcc gcaaatgtcg 
agatatgtag ggtagcatct tccaccgcgg 
tgcgagggag cgaggaggtc gggaccgagg 
actatctgcc tgaagatggc atgtgagttg 
aagctggcgt ctgtgagacc taccgcgtca 
ttgttgacca gctcggcggt gacctgcacg 
atgatgtcat acttatcctg tccctttttt 
tcgcggtctt tccagtactc ttggatcgga 
agcatgtaga actggttgac ggcctggtag 
tatgcctgcg cggccttccg gagcgaggtg 
actttgaggt actggtattt gaagtcagtg 
aagtccgtgc gctttttgga acgcggattt 
atctttcccg cgcgaggcat aaagttgcgt 
cggttgttaa ttacctgggc ggcgagcacg 
acaatgtaaa gttccaagaa gcgcgggatg 
tcgtaggtga gctcttcagg ggagctgagc 
tgagggttgg aagcgacgaa tgagctccac 
tcgcgaaagg tcctaaactg gcgacctatg 
gtaagcgggt cttgttccca gcggtcccat 
gtcactagag gctcatctcc gccgaacttc 
ccaaaggccc ccatccaagt ataggtctct 
cgaggatgcg agccgatcgg gaagaactgg 
ttgatgtggt gaaagtagaa gtccctgcga 
aaacgtgcgc agtactggca gcggtgcacg 
cgaccgcgca caaggaagca gagtgggaat 
tggtcttcta cttcggctgc ttgtccttga 
gatcggacca ccacgccgcg cgagcccaaa 
ttgatgacaa catcgcgcag atgggagctg 
tcaggcggga gctcctgcag gtttacctcg 
aggtgatacc taatttccag gggctggttg 
catccccgcg gcgcgactac ggtaccgcgc 
gatgatgcat ctaaaagcgg tgacgcgggc 
ccgccgggag agggggcagg ggcacgtcgg 
cgcgtaggtt gctggcgaac gcgacgacgc 
gcgtgaagac gacgggcccg gtgagcttga 
cggtgtcgtt gacggcggcc tggcgcaaaa 
aggcgatctc ggccatgaac tgctcgatct 
gctccacggt ggcggcgagg tcgttggaaa 
ggcctccctc gttccagacg cggctgtaga 
tgaccacctg cgcgagattg agctccacgt 
gctgaaagag gtagttgagg gtggtggcgg 
agcgtcgcaa cgtggattcg ttgatatccc 
agaagtccac ggcgaagttg aaaaactggg 
ccagaagacg gatgagctcg gcgacagtgt 
cctcttcttc ttcttcaatc tcctcttcca 
gcggtggggg aggggggaca cggcggcgac 
gctcgatcat ctccccgcgg cgacggcgca 
9999gcgcag ttggaagacg ccgcccgtca 
catgcggcag ggatacggcg ctaacgatgc 
cgccgaggga cctgagcgag tccgcatcga 
ctaaccagtc acagtcgcaa ggtaggctga 
c 9999ttgtt tctggcggag gtgctgctga 
99 c 99 at 99 fc cgacagaagc accatgtcct 
cggccatgcc ccaggcttcg ttttgacatc 
gcctttctac cggcacttct tcttctcctt 
ctgcggcggc ggcggagttt ggccgtaggt 
cgaagcccct catcggctga agcagggcta 
cctgctgcac ctgcgtgagg gtagactgga 
cgcccgtgtt gatggtgtaa gtgcagttgg 
ccggctgcga gagctcggtg tacctgagac 
cgttgcaagt ccgcaccagg tactggtatc 
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taaacgtaga ggggctctct gagtattcca 792 0 
atgctggcgc gcacgtaatc gtatagttcg 7980 
ttgctacggg cgggctgctc tgctcggaag 8040 
gatgatatgg ttggacgctg gaagacgttg 8100 
cgcacgaagg aggcgtagga gtcgcgcagc 8160 
tctagggcgc agtagtccag ggtttccttg 8220 
ttccacagct cgcggttgag gacaaactct 8280 
aacccgtcgg cctccgaacg gtaagagcct 8340 
gcgcagcatc ccttttctac gggtagcgcg 84 00 
tgggtgagcg caaaggtgtc cctgaccatg 8460 
tcgtcgcatc cgccctgctc ccagagcaaa 852 0 
ggcagggcga aggtgacatc gttgaagagt 8580 
gtgatgcgga agggtcccgg cacctcggaa 8640 
atctcgtcaa agccgttgat gttgtggccc 8700 
cccttgatgg aaggcaattt tttaagttcc 8760 
ccgtgctctg aaagggccca gtctgcaaga 882 0 
aggtcacggg ccattagcat ttgcaggtgg 8880 
gccatttttt ctggggtgat gcagtagaag 8940 
ccaaggttcg cggctaggtc tcgcgcggca 9000 
atgaccagca tgaagggcac gagctgcttc 9060 
acatcgtagg tgacaaagag acgctcggtg 9120 
atctcccgcc accaattgga ggagtggcta 9180 
cgggccgaac actcgtgctg gcttttgtaa 9240 
ggctgtacat cctgcacgag gttgacctga 9300 
ttgagcccct cgcctggcgg gtttggctgg 9360 
ccgtctggct gctcgagggg agttacggtg 9420 
gtccagatgt ccgcgcgcgg cggtcggagc 9480 
tccatggtct ggagctcccg cggcgtcagg 9540 
catagacggg tcagggcgcg ggctagatcc 9600 
gtggcggcgt cgatggcttg caagaggccg 9660 
ggcgggcggt gggccgcggg ggtgtccttg 9720 
gagcccccgg aggtaggggg ggctccggac 9780 
cgccgcgcgc gggcaggagc tggtgctgcg 9840 
ggcggttgat ctcctgaatc tggcgcctct 9900 
gcctgaaaga gagttcgaca gaatcaattt 9960 
tctcctgcac gtctcctgag ttgtcttgat 1002 0 
cttcctcctg gagatctccg cgtccggctc 10080 
tgcgggccat gagctgcgag aaggcgttga 10140 
ccacgccccc ttcggcatcg cgggcgcgca 10200 
gccgggcgaa gacggcgtag tttcgcaggc 10260 
tgtgttctgc cacgaagaag tacataaccc 10320 
ccaaggcctc aaggcgctcc atggcctcgt 10380 
agttgcgcgc cgacacggtt aactcctcct 1044 0 
cgcgcacctc gcgctcaaag gctacagggg 10500 
taagggcctc cccttcttct tcttctggcg 10560 
gacggcgcac cgggaggcgg tcgacaaagc 10620 
tggtctcggt gacggcgcgg ccgttctcgc 10680 
tgtcccggtt atgggttggc ggggggctgc 10740 
atctcaacaa ttgttgtgta ggtactccgc 10800 
ccggatcgga aaacctctcg agaaaggcgt 10860 
gcaccgtggc gggcggcagc gggcggcggt 10920 
tgatgtaatt aaagtaggcg gtcttgagac 10980 
tgggtccggc ctgctgaatg cgcagg[cggt 1104 0 
ggcgcaggtc tttgtagtag tcttgcatga 11100 
cctcttgtcc tgcatctctt gcatctatcg 11160 
ggcgccctct tcctcccatg cgtgtgaccc 11220 
ggtcggcgac aacgcgctcg gctaatatgg 11280 
agtcatccat gtccacaaag cggtggtatg 1134 0 
ccataacgga ccagttaacg gtctggtgac 11400 
gcgagtaagc cctcgagtca aatacgtagt 11460 
ccaccaaaaa gtgcggcggc ggctggcggt 11520 
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agaggggcca gcgtagggtg gccggggctc 
gatatccgta gatgtacctg gacatccagg 
gaaagtcgcg gacgcggttc cagatgttgc 
cgctctggcc ggtcaggcgc gcgcaatcgt 
tgtaagcggg cactcttccg tggtctggtg 
accggggttc gagccccgta tccggccgtc 
gtcgaaccca ggtgtgcgac gtcagacaac 
gcgcggcggc tgctgcgcta gcttttttgg 
gctggaaagc gaaagcatta agtggctcgc 
gttgagtcgc gggacccccg gttcgagtct 
ttgcctcccc gtcatgcaag accccgcttg 
ttttttgctt ttcccagatg catccggtgc 
ggcaagagca agagcagcgg cagacatgca 
gaggggcgac atccgcggtt gacgcggcag 
gggcccggca ctacctggac ttggaggagg 
ctcctgagcg gtacccaagg gtgcagctga 
ggcagaacct gtttcgcgac cgcgagggag 
tccacgcagg gcgcgagctg cggcatggcc 
actttgagcc cgacgcgcga accgggatta 
acctggtaac cgcatacgag cagacggtga 
acaaccacgt gcgtacgctt gtggcgcgcg 
gggactttgt aagcgcgctg gagcaaaacc 
tccttatagt gcagcacagc agggacaacg 
tagagcccga gggccgctgg ctgctcgatt 
aggagcgcag cttgagcctg gctgacaagg 
tgggcaagtt ttacgcccgc aagatatacc 
taaagatcga ggggttctac atgcgcatgg 
tgggcgttta tcgcaacgag cgcatccaca 
tcagcgaccg cgagctgatg cacagcctgc 
atagagaggc cgagtcctac tttgacgcgg 
gcgccctgga ggcagctggg gccggacctg 
acgtcggcgg cgtggaggaa tatgacgagg 
actaagcggt gatgtttctg atcagatgat 
ggcgctgcag agccagccgt ccggccttaa 
ccgcatcatg tcgctgactg cgcgcaatcc 
ccggctctcc gcaattctgg aagcggtggt 
ggtgctggcg atcgtaaacg cgctggccga 
cctggtctac gacgcgctgc ttcagcgcgt 
caacctggac cggctggtgg gggatgtgcg 
gcagcagggc aacctgggct ccatggttgc 
caacgtgccg cggggacagg aggactacac 
gactgagaca ccgcaaagtg aggtgtacca 
tagacaaggc ctgcagaccg taaacctgag 
gggggtgcgg gctcccacag gcgaccgcgc 
gcgcctgttg ctgctgctaa tagcgccctt 
atacctaggt cacttgctga cactgtaccg 
gcatactttc caggagatta caagtgtcag 
cctggaggca accctaaact acctgctgac 
cagtttaaac agcgaggagg agcgcatttt 
cctgatgcgc gacggggtaa cgcccagcgt 
accgggcatg tatgcctcaa accggccgtt 
cgcggccgcc gtgaaccccg agtatttcac 
gccccctggt ttctacaccg ggggattcga 
ggacgacata gacgacagcg tgttttcccc 
gcgcgagcag gcagaggcgg cgctgcgaaa 
cgatctaggc gctgcggccc cgcggtcaga 
gtctcttacc agcactcgca ccacccgccc 
caactcgctg ctgcagccgc agcgcgaaaa 
gatagagagc ctagtggaca agatgagtag 
cgtgccaggc ccgcgcccgc ccacccgtcg 
gtgggaggac gatgactcgg cagacgacag 
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c 99gggcgag atcttccaac ataaggcgat 1158 0 
tgatgccggc ggcggtggtg gaggcgcgcg 11640 
gcagcggcaa aaagtgctcc atggtcggga 11700 
tgacgctcta gaccgtgcaa aaggagagcc 11760 
gataaattcg caagggtatc atggcggacg 1182 0 
cgccgtgatc catgcggtta ccgcccgcgt 11880 
gggggagtgc tccttttggc ttccttccag 1194 0 
ccactggccg cgcgcagcgt aagcggttag 12000 
tccctgtagc cggagggtta ttttccaagg 12 060 
cggaccggcc ggactgcggc gaacgggggt 1212 0 
caaattcctc cggaaacagg gacgagcccc 12180 
tgcggcagat gcgcccccct cctcagcagc 1224 0 
gggcaccctc ccctcctcct accgcgtcag 123 00 
cagatggtga ttacgaaccc ccgcggcgcc 12360 
gcgagggcct ggcgcggcta ggagcgccct 1242 0 
agcgtgatac gcgtgaggcg tacgtgccgc 124 80 
aggagcccga ggagatgcgg gatcgaaagt 1254 0 
tgaatcgcga gcggttgctg cgcgaggagg 126 0 0 
gtcccgcgcg cgcacacgtg gcggccgccg 1266 0 
accaggagat taactttcaa aaaagcttta 1272 0 
aggaggtggc tataggactg atgcatctgt 12780 
caaatagcaa gccgctcatg gcgcagctgt 12 84 0 
aggcattcag ggatgcgctg ctaaacatag 1290 0 
tgataaacat cctgcagagc atagtggtgc 12 960 
tggccgccat caactattcc atgcttagcc 13 02 0 
atacccctta cgttcccata gacaaggagg 13 080 
cgctgaaggt gcttaccttg agcgacgacc 1314 0 
aggccgtgag cgtgagccgg cggcgcgagc 132 00 
aaagggccct ggctggcacg ggcagcggcg 132 60 
gcgctgacct gcgctgggcc ccaagccgac 13 32 0 
ggctggcggt ggcacccgcg cgcgctggca 13380 
acgatgagta cgagccagag gacggcgagt 1344 0 
gcaagacgca acggacccgg cggtgcgggc 1350 0 
ctccacggac gactggcgcc aggtcatgga 13560 
tgacgcgttc cggcagcagc cgcaggccaa 13 62 0 
cccggcgcgc gcaaacccca cgcacgagaa 13680 
aaacagggcc atccggcccg acgaggccgg 13 74 0 
ggctcgttac aacagcggca acgtgcagac 13 800 
cgaggccgtg gcgcagcgtg agcgcgcgca 13 860 
actaaacgcc ttcctgagta cacagcccgc 13 92 0 
caactttgtg agcgcactgc ggctaatggt 13 98 0 
gtctgggcca gactattttt tccagaccag 14 04 0 
ccaggctttc aaaaacttgc aggggctgtg 14100 
gaccgtgtct agcttgctga cgcccaactc 14160 
cacggacagt ggcagcgtgt cccgggacac 14220 
cgaggccata ggtcaggcgc atgtggacga 142 80 
ccgcgcgctg gggcaggagg acacgggcag 14340 
caaccggcgg cagaagatcc cctcgttgca 144 0 0 
gcgctacgtg cagcagagcg tgagccttaa 144 60 
ggcgctggac atgaccgcgc gcaacatgga 14520 
tatcaaccgc ctaatggact acttgcatcg 14580 
caatgccatc ttgaacccgc actggctacc 14640 
ggtgcccgag ggtaacgatg gattcctctg 14700 
gcaaccgcag accctgctag agttgcaaca 14760 
ggaaagcttc cgcaggccaa gcagcttgtc 14 82 0 
tgctagtagc ccatttccaa gcttgatagg 14880 
gcgcctgctg ggcgaggagg agtacctaaa 14 94 0 
aaacctgcct ccggcatttc ccaacaacgg 15000 
atggaagacg tacgcgcagg agcacaggga 15060 
tcaaaggcac gaccgtcagc ggggtctggt 15120 
cagcgtcctg gatttgggag ggagtggcaa 15180 
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cccgtttgcg caccttcgcc ccaggctggg gagaatgttt taaaaaaaaa aaagcatgat 15240 
gcaaaataaa aaactcacca aggccatggc accgagcgtt ggttttcttg tattcccctt 1530 0 
agtatgcggc gcgcggcgat gtatgaggaa ggtcctcctc cctcctacga gagtgtggtg 15360 
agcgcggcgc cagtggcggc ggcgctgggt tctcccttcg atgctcccct ggacccgccg 15420 
tttgtgcctc cgcggtacct gcggcctacc ggggggagaa acagcatccg ttactctgag 15480 
ttggcacccc tattcgacac cacccgtgtg tacctggtgg acaacaagtc aacggatgtg 1554 0 
gcatccctga actaccagaa cgaccacagc aactttctga ccacggtcat tcaaaacaat 1560 0 
gactacagcc cgggggaggc aagcacacag accatcaatc ttgacgaccg gtcgcactgg 156 6 0 
ggcggcgacc tgaaaaccat cctgcatacc aacatgccaa atgtgaacga gttcatgttt 1572 0 
accaataagt ttaaggcgcg ggtgatggtg tcgcgcttgc ctactaagga caatcaggtg 15780 
gagctgaaat acgagtgggt ggagttcacg ctgcccgagg gcaactactc cgagaccatg 15840 
accatagacc ttatgaacaa cgcgatcgtg gagcactact tgaaagtggg cagacagaac 15900 
ggggttctgg aaagcgacat cggggtaaag tttgacaccc gcaacttcag actggggttt 15960 
gaccccgtca ctggtcttgt catgcctggg gtatatacaa acgaagcctt ccatccagac 1602 0 
atcattttgc tgccaggatg cggggtggac ttcacccaca gccgcctgag caacttgttg 16080 ■ 
ggcatccgca agcggcaacc cttccaggag ggctttagga tcacctacga tgatctggag 1614 0 
ggtggtaaca ttcccgcact gttggatgtg gacgcctacc aggcgagctt gaaagatgac 162 0 0 
accgaacagg gcgggggtgg cgcaggcggc agcaacagca gtggcagcgg cgcggaagag 16260 ' 
aactccaacg cggcagccgc ggcaatgcag ccggtggagg acatgaacga tcatgccatt 1632 0 
cgcggcgaca cctttgccac acgggctgag gagaagcgcg ctgaggccga agcagcggcc 163 80 
gaagctgccg cccccgctgc gcaacccgag gtcgagaagc ctcagaagaa accggtgatc 16440 
aaacccctga cagaggacag caagaaacgc agttacaacc taataagcaa tgacagcacc 16500 
ttcacccagt accgcagctg gtaccttgca tacaactacg gcgaccctca gaccggaatc 16560 
cgctcatgga ccctgctttg cactcctgac gtaacctgcg gctcggagca ggtctactgg 16 62 0 
tcgttgccag acatgatgca agaccccgtg accttccgct ccacgcgcca gatcagcaac 16680 
tttccggtgg tgggcgccga gctgttgccc gtgcactcca agagcttcta caacgaccag 16740 
gccgtctact cccaactcat ccgccagttt acctctctga cccacgtgtt caatcgcttt 16800 
cccgagaacc agattttggc gcgcccgcca gcccccacca tcaccaccgt cagtgaaaac 16860 
gttcctgctc tcacagatca cgggacgcta ccgctgcgca acagcatcgg aggagtccag 16 92 0 
cgagtgacca ttactgacgc cagacgccgc acctgcccct acgtttacaa ggccctgggc 16980 
atagtctcgc cgcgcgtcct atcgagccgc actttttgag caagcatgtc catccttata 17 04 0 
tcgcccagca ataacacagg ctggggcctg cgcttcccaa gcaagatgtt tggcggggcc 17100 
aagaagcgct ccgaccaaca cccagtgcgc gtgcgcgggc actaccgcgc gccctggggc 17160 
gcgcacaaac gcggccgcac tgggcgcacc accgtcgatg acgccatcga cgcggtggtg 1722 0 
gaggaggcgc gcaactacac gcccacgccg ccaccagtgt ccacagtgga cgcggccatt i72 8 0 
cagaccgtgg tgcgcggagc ccggcgctat gctaaaatga agagacggcg gaggcgcgta 1734 0 
gcacgtcgcc accgccgccg acccggcact gccgcccaac gcgcggcggc ggccctgctt 174 0 0 
aaccgcgcac gtcgcaccgg ccgacgggcg gccatgcggg ccgctcgaag gctggccgcg 174 60 
ggtattgtca ctgtgccccc caggtccagg cgacgagcgg ccgccgcagc agccgcggcc 1752 0 
attagtgcta tgactcaggg tcgcaggggc aacgtgtatt gggtgcgcga ctcggttagc 17580 
ggcctgcgcg tgcccgtgcg cacccgcccc ccgcgcaact agattgcaag aaaaaactac 17640 
ttagactcgt actgttgtat gtatccagcg gcggcggcgc gcaacgaagc tatgtccaag 177 0 0 
cgcaaaatca aagaagagat gctccaggtc atcgcgccgg agatctatgg ccccccgaag 17760 
aaggaagagc aggattacaa gccccgaaag ctaaagcggg tcaaaaagaa aaagaaagat 1782 0 
gatgatgatg aacttgacga cgaggtggaa ctgctgcacg ctaccgcgcc caggcgacgg 17880 
Otacagtgga aaggtcgacg cgtaaaacgt gttttgcgac ccggcaccac cgtagtcttt 17940 
acgcccggtg agcgctccac ccgcacctac aagcgcgtgt atgatgaggt gtacggcgac 18 000 
gaggacctgc ttgagcaggc caacgagcgc ctcggggagt ttgcctacgg aaagcggcat 18060 
aaggacatgc tggcgttgcc gctggacgag ggcaacccaa cacctagcct aaagcccgta 18120 
acactgcagc aggtgctgcc cgcgcttgca ccgtccgaag aaaagcgcgg cctaaagcgc 18180 
gagtctggtg acttggcacc caccgtgcag ctgatggtac ccaagcgcca gcgactggaa 18240 
gatgtcttgg aaaaaatgac cgtggaacct gggctggagc ccgaggtccg cgtgcggcca 183 00 
atcaagcagg tggcgccggg actgggcgtg cagaccgtgg acgttcagat acccactacc 183 60 
agtagcacca gtattgccac cgccacagag ggcatggaga cacaaacgtc cccggttgcc 1842 0 
tcagcggtgg cggatgccgc ggtgcaggcg gtcgctgcgg ccgcgtccaa gacctctacg 18480 
gaggtgcaaa cggacccgtg gatgtttcgc gtttcagccc cccggcgccc gcgcggttcg 18540 
aggaagtacg gcgccgccag cgcgctactg cccgaatatg ccctacatcc ttccattgcg 186 0 0 
cctacccccg gctatcgtgg ctacacctac cgccccagaa gacgagcaac tacccgacgc 18660 
cgaaccacca ctggaacccg ccgccgccgt cgccgtcgcc agcccgtgct ggccccgatt 1872 0 
tccgtgcgca gggtggctcg cgaaggaggc aggaccctgg tgctgccaac agcgcgctac 18780 
caccccagca tcgtttaaaa gccggtcttt gtggttcttg cagatatggc cctcacctgc 18 840 
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cgcctccgtt tcccggtgcc gggattccga ggaagaatgc accgtaggag gggcatggcc 18900 
ggccacggcc tgacgggcgg catgcgtcgt gcgcaccacc ggcggcggcg cgcgtcgcac 18960 
cgtcgcatgc gcggcggtat cctgcccctc cttattccac tgatcgccgc ggcgattggc 1902 0 
gccgtgcccg gaattgcatc cgtggccttg caggcgcaga gacactgatt aaaaacaagt 19080 
tgcatgtgga aaaatcaaaa taaaaagtct ggactctcac gctcgcttgg tcctgtaact 1914 0 
attttgtaga atggaagaca tcaactttgc gtctctggcc ccgcgacacg gctcgcgccc 1920 0 
gttcatggga aactggcaag atatcggcac cagcaatatg agcggtggcg ccttcagctg 19260 
gggctcgctg tggagcggca ttaaaaattt cggttccacc gttaagaact atggcagcaa 1932 0 
ggcctggaac agcagcacag gccagatgct gagggataag ttgaaagagc aaaatttcca 1938 0 
acaaaaggtg gtagatggcc tggcctctgg cattagcggg gtggtggacc tggccaacca 1944 0 
ggcagtgcaa aataagatta acagtaagct tgatccccgc cctcccgtag aggagcctcc 19500 
accggccgtg gagacagtgt ctccagaggg gcgtggcgaa aagcgtccgc gccccgacag 1956 0 
ggaagaaact ctggtgacgc aaatagacga gcctccctcg tacgaggagg cactaaagca 1962 0 
a ggcctgccc accacccgtc ccatcgcgcc catggctacc ggagtgctgg gccagcacac 19680 
acccgtaacg ctggacctgc ctccccccgc cgacacccag cagaaacctg tgctgccagg 19740 
cccgaccgcc gttgttgtaa cccgtcctag ccgcgcgtcc ctgcgccgcg ccgccagcgg 1980 0 
tccgcgatcg ttgcggcccg tagccagtgg caactggcaa agcacactga acagcatcgt 19860 
gggtctgggg gtgcaatccc tgaagcgccg acgatgcttc tgaatagcta acgtgtcgta 1992 0 
tgtgtgtcat gtatgcgtcc atgtcgccgc cagaggagct gctgagccgc cgcgcgcccg 19980 
ctttccaaga tggctacccc ttcgatgatg ccgcagtggt cttacatgca catctcgggc 20040 
caggacgcct cggagtacct gagccccggg ctggtgcagt ttgcccgcgc caccgagacg 20100 
tacttcagcc tgaataacaa gtttagaaac cccacggtgg cgcctacgca cgacgtgacc 2 0160 
acagaccggt cccagcgttt gacgctgcgg ttcatccctg tggaccgtga ggatactgcg 2022 0 
tactcgtaca aggcgcggtt caccctagct gtgggtgata accgtgtgct ggacatggct 20280 
tccacgtact ttgacatccg cggcgtgctg gacaggggcc ctacttttaa gccctactct 2 034 0 
ggcactgcct acaacgccct ggctcccaag ggtgccccaa atccttgcga atgggatgaa 20400 
gctgctactg ctcttgaaat aaacctagaa gaagaggacg atgacaacga agacgaagta 2 0460 
gacgagcaag ctgagcagca aaaaactcac gtatttgggc aggcgcctta ttctggtata 2 0520 
aatattacaa aggagggtat tcaaataggt gtcgaaggtc aaacacctaa atatgccgat 20580 
aaaacatttc aacctgaacc tcaaatagga gaatctcagt ggtacgaaac tgaaattaat 20640 
catgcagctg ggagagtcct taaaaagact accccaatga aaccatgtta cggttcatat 2 0700 
gcaaaaccca caaatgaaaa tggagggcaa ggcattcttg taaagcaaca aaatggaaag 20760 
ctagaaagtc aagtggaaat gcaatttttc tcaactactg aggcgaccgc aggcaatggt 2 0820 
gataacttga ctcctaaagt ggtattgtac agtgaagatg tagatataga aaccccagac 20 880 
actcatattt cttacatgcc cactattaag gaaggtaact cacgagaact aatgggccaa 2 0 940 
caatctatgc ccaacaggcc taattacatt gcttttaggg acaattttat tggtctaatg 21000 
tattacaaca gcacgggtaa tatgggtgtt ctggcgggcc aagcatcgca gttgaatgct 21060 
gttgtagatt tgcaagacag aaacacagag ctttcatacc agcttttgct tgattccatt 21120 
ggtgatagaa ccaggtactt ttctatgtgg aatcaggctg ttgacagcta tgatccagat 21180 
gttagaatta ttgaaaatca tggaactgaa gatgaacttc caaattactg ctttccactg 21240 
99 a 99Jtgtga ttaatacaga gactcttacc aaggtaaaac ctaaaacagg tcaggaaaat 21300 
ggatgggaaa aagatgctac agaattttca gataaaaatg aaataagagt tggaaataat 213 60 
tttgccatgg aaatcaatct aaatgccaac ctgtggagaa atttcctgta ctccaacata 21420 
gcgctgtatt tgcccgacaa gctaaagtac agtccttcca acgtaaaaat ttctgataac 21480 
ccaaacacct acgactacat gaacaagcga gtggtggctc ccgggttagt ggactgctac 2154 0 
attaaccttg gagcacgctg gtcccttgac tatatggaca acgtcaaccc atttaaccac 21600 
caccgcaatg ctggcctgcg ctaccgctca atgttgctgg gcaatggtcg ctatgtgccc 21660 
ttccacatcc aggtgcctca gaagttcttt gccattaaaa acctccttct cctgccgggc 21720 
tcatacacct acgagtggaa cttcaggaag gatgttaaca tggttctgca gagctcccta 21780 
ggaaatgacc taagggttga cggagccagc attaagtttg atagcatttg cctttacgcc 21840 
accttcttcc ccatggccca caacaccgcc tccacgcttg aggccatgct tagaaacgac 21900 
accaacgacc agtcctttaa cgactatctc tccgccgcca acatgctcta ccctataccc 21960 
gccaacgcta ccaacgtgcc catatccatc ccctcccgca actgggcggc tttccgcggc 22020 
tgggccttca cgcgccttaa gactaaggaa accccatcac tgggctcggg ctacgaccct 22080 
tattacacct actctggctc tataccctac ctagatggaa ccttttacct caaccacacc 22140 
tttaagaagg tggccattac ctttgactct tctgtcagct ggcctggcaa tgaccgcctg 22200 
cttaccccca acgagtttga aattaagcgc tcagttgacg gggagggtta caacgttgcc 222 60 
cagtgtaaca tgaccaaaga ctggttcctg gtacaaatgc tagctaacta caacattggc 22320 
taccagggct tctatatccc agagagctac aaggaccgca tgtactcctt ctttagaaac 223 80 
ttccagccca tgagccgtca ggtggtggat gatactaaat acaaggacta ccaacaggtg 22440 
ggcatcctac accaacacaa caactctgga tttgttggct accttgcccc caccatgcgc 22500 
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gaaggacagg cctaccctgc taacttcccc tatccgctta taggcaagac cgcagttgac 22560 
agcattaccc agaaaaagtt tctttgcgat cgcacccttt ggcgcatccc attctccagt 22620 
aactttatgt ccatgggcgc actcacagac ctgggccaaa accttctcta cgccaactcc 22680 
gcccacgcgc tagacatgac ttttgaggtg gatcccatgg acgagcccac ccttctttat 22740 
gttttgtttg aagtctttga cgtggtccgt gtgcaccggc cgcaccgcgg cgtcatcgaa 22800 
accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca caacataaag aagcaagcaa 22 860 
catcaacaac agctgccgcc atgggctcca gtgagcagga actgaaagcc attgtcaaag 22920 
atcttggttg tgggccatat tttttgggca cctatgacaa gcgctttcca ggctttgttt 22980 
ctccacacaa gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac 2 3040 
actggatggc ctttgcctgg aacccgcact caaaaacatg ctacctcttt gagccctttg 23100 
gcttttctga ccagcgactc aagcaggttt accagtttga gtacgagtca ctcctgcgcc 23160 
gtagcgccat tgcttcttcc cccgaccgct gtataacgct ggaaaagtcc acccaaagcg 2322 0 
tacaggggcc caactcggcc gcctgtggac tattctgctg catgtttctc cacgcctttg 23280 
ccaactggcc ccaaactccc atggatcaca accccaccat gaaccttatt accggggtac 23340 
ccaactccat gctcaacagt ccccaggtac agcccaccct gcgtcgcaac caggaacagc 23400 
tctacagctt cctggagcgc cactcgccct acttccgcag ccacagtgcg cagattagga 23460 
gcgccacttc tttttgtcac ttgaaaaaca tgtaaaaata atgtactaga gacactttca 23520 
ataaaggcaa atgcttttat ttgtacactc tcgggtgatt atttaccccc acccttgccg 23580 
tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc actggcaggg 23640 
acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc atccgcggca 23700 
gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt agcaggtcgg 23760 
gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag ttgcgataca 2382 0 
ca gggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc agcacgctct 23880 
tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact 2394 0 
ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac tcgcaccgta 24 000 
gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc tgcataaaag 24060 
ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac atgccgcaag 24120 
acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac cttgcgtcgg 24180 
tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg gccttgctag 2424 0 
actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca atcacgtgct 24300 
ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc tcagcgcagc 24360 
ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc tctgcaaacg 2442 0 
actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg ttgctggtga 24480 
aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg gccgccagag 24540 
cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc acgtggtact 24600 
tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg atcggcacac 24660 
tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc tcttcctctt 24720 
gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact gtgcgcttac 24780 
ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt tgtagcgcca 24840 
catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg cgctcgggct 24900 
tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc gccgaggtcg 24960 
atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct tcctcgtcct 25020 
cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc ggcggcgacg 25080 
gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg cgtccgcgct 25140 
cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc tataggcaga 25200 
aaaagatcat ggagtcagtc gagaagaagg acagcctaac cgccccctct gagttcgcca 25260 
ccaccgcctc caccgatgcc gccaacgcgc ctaccacctt ccccgtcgag gcacccccgc 25320 
ttgaggagga ggaagtgatt atcgagcagg acccaggttt tgtaagcgaa gacgacgagg 253 8 0 
accgctcagt accaacagag gataaaaagc aagaccagga caacgcagag gcaaacgagg 25440 
aacaagtcgg gcggggggac gaaaggcatg gcgactacct agatgtggga gacgacgtgc 25500 
tgttgaagca tctgcagcgc cagtgcgccal ttatctgcga cgcgttgcaa gagcgcagcg 25560 
atgtgcccct cgccatagcg gatgtcagcc ttgcctacga acgccaccta ttctcaccgc 2562 0 
gcgtaccccc caaacgccaa gaaaacggca catgcgagcc caacccgcgc ctcaacttct 25680 
accccgtatt tgccgtgcca gaggtgcttg ccacctatca catctttttc caaaactgca 25740 
agatacccct atcctgccgt gccaaccgca gccgagcgga caagcagctg gccttgcggc 25800 
agggcgctgt catacctgat atcgcctcgc tcaacgaagt gccaaaaatc tttgagggtc 25860 
ttggacgcga cgagaagcgc gcggcaaacg ctctgcaaca ggaaaacagc gaaaatgaaa 25920 
gtcactctgg agtgttggtg gaactcgagg gtgacaacgc gcgcctagcc gtactaaaac 25980 
gcagcatcga ggtcacccac tttgcctacc cggcacttaa cctacccccc aaggtcatga 26040 
gcacagtcat gagtgagctg atcgtgcgcc gtgcgcagcc cctggagagg gatgcaaatt 26100 
tgcaagaaca aacagaggag ggcctacccg cagttggcga cgagcagcta gcgcgctggc 26160 
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ttcaaacgcg cgagcctgcc gacttggagg agcgacgcaa actaatgatg gccgcagtgc 2622 0 
tcgttaccgt ggagcttgag tgcatgcagc ggttctttgc tgacccggag atgcagcgca 2 6280 
agctagagga aacattgcac tacacctttc gacagggcta cgtacgccag gcctgcaaga 2 6340 
tctccaacgt ggagctctgc aacctggtct cctaccttgg aattttgcac gaaaaccgcc 26400 
ttgggcaaaa cgtgcttcat tccacgctca agggcgaggc gcgccgcgac tacgtccgcg 26460 
actgcgttta cttatttcta tgctacacct ggcagacggc catgggcgtt tggcagcagt 26520 
gcttggagga gtgcaacctc aaggagctgc agaaactgct aaagcaaaac ttgaaggacc 26580 
tatggacggc cttcaacgag cgctccgtgg ccgcgcacct ggcggacatc attttccccg 26640 
aacgcctgct taaaaccctg caacagggtc tgccagactt caccagtcaa agcatgtrtgc 26700 
agaactttag gaactttatc ctagagcgct caggaatctt gcccgccacc tgctgtgcac 26760 
ttcctagcga ctttgtgccc attaagtacc gcgaatgccc tccgccgctt tggggccact 2682 0 
gctaccttct gcagctagcc aactaccttg cctaccactc tgacataatg gaagacgtga 26880 
gcggtgacgg tctactggag tgtcactgtc gctgcaacct atgcaccccg caccgctccc 26940 
tggtttgcaa ttcgcagctg cttaacgaaa gtcaaattat cggtaccttt gagctgcagg 27000 
gtccctcgcc tgacgaaaag tccgcggctc cggggttgaa actcactccg gggctgtgga 27060 
cgtcggctta ccttcgcaaa tttgtacctg aggactacca cgcccacgag attaggttct 27120 
acgaagacca atcccgcccg ccaaatgcgg agcttaccgc ctgcgtcatt acccagggcc 27180 
acattcttgg ccaattgcaa gccatcaaca aagcccgcca agagtttctg ctacgaaagg 27240 
9acggggggt ttacttggac ccccagtccg gcgaggagct caacccaatc cccccgccgc 27300 
cgcagcccta tcagcagcag ccgcgggccc ttgcttccca ggatggcacc caaaaagaag 2 7360 
ctgcagctgc cgccgccacc cacggacgag gaggaatact gggacagtca ggcagaggag 27420 
gttttggacg aggaggagga ggacatgatg gaagactggg agagcctaga cgaggaagct 27480 
tccgaggtcg aagaggtgtc agacgaaaca ccgtcaccct cggtcgcatt cccctcgccg 27540 
gcgccccaga aatcggcaac cggttccagc atggctacaa cctccgctcc tcaggcgccg 27600 
ccggcactgc ccgttcgccg acccaaccgt agatgggaca ccactggaac cagggccggt 27660 
aagtccaagc agccgccgcc gttagcccaa gagcaacaac agcgccaagg ctaccgctca 27720 
tggcgcgggc acaagaacgc catagttgct tgcttgcaag actgtggggg caacatctcc 27780 
ttcgcccgcc gctttcttct ctaccatcac ggcgtggcct tcccccgtaa catcctgcat 27840 
tactaccgtc atctctacag cccatactgc accggcggca gcggcagcgg cagcaacagc 2 7900 
agcggccaca cagaagcaaa ggcgaccgga tagcaagact ctgacaaagc ccaagaaatc 27960 
cacagcggcg gcagcagcag gaggaggagc gctgcgtctg gcgcccaacg aacccgtatc 28020 
gacccgcgag cttagaaaca ggatttttcc cactctgtat gctatatttc aacagagcag 2 8080 
gggccaagaa caagagctga aaataaaaaa caggtctctg cgatccctca cccgcagctg 2 8140 
cctgtatcac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg aggctctctt 28200 
cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc aaatttaagc 28260 
gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgtcg tcagcgccat 28320 
tatgagcaag gaaattccca cgccctacat gtggagttac cagccacaaa tgggacttgc 28380 
ggctggagct gcccaagact actcaacccg aataaactac atgagcgcgg gaccccacat 2844 0 
gatatcccgg gtcaacggaa tccgcgccca ccgaaaccga attctcttgg aacaggcggc 28500 
tattaccacc acacctcgta ataaccttaa tccccgtagt tggcccgctg ccctggtgta 2 8560 
ccaggaaagt cccgctccca ccactgtggt acttcccaga gacgcccagg ccgaagttca 2 8620 
gatgactaac tcaggggcgc agcttgcggg cggctttcgt cacagggtgc ggtcgcccgg 28680 
gcagggtata actcacctga caatcagagg gcgaggtatt cagctcaacg acgagtcggt 28740 
gagctcctcg cttggtctcc gtccggacgg gacatttcag atcggcggcg ccggccgtcc 2 880 0 
ttcattcacg cctcgtcagg caatcctaac tctgcagacc tcgtcctctg agccgcgctc 28860 
tggaggcatt ggaactctgc aatttattga ggagtttgtg ccatcggtct actttaaccc 28920 
cttctcggga cctcccggcc actatccgga tcaatttatt cctaactttg acgcggtaaa 28980 
ggactcggcg gacggctacg actgaatgtt aagtggagag gcagagcaac tgcgcctgaa 29040 
acacctggtc cactgtcgcc gccacaagtg ctttgcccgc gactccggtg agttttgcta 29100 
ctttgaattg cccgaggatc atatcgaggg cccggcgcac ggcgtccggc ttaccgccca 29160 
gggagagctt gcccgtagcc tgattcggga gtttacccag cgccccctgc tagttgagcg 2 922 0 
ggacagggga ccctgtgttc tcactgtgat ttgcaactgt cctaaccttg gattacatca 29280 
agatctttgt tgccatctct gtgctgagta taataaatac agaaattaaa atatactggg 29340 
gctcctatcg ccatcctgta aacgccaccg tcttcacccg cccaagcaaa ccaaggcgaa 29400 
ccttacctgg tacttttaac atctctccct ctgtgattta caacagtttc aacccagacg 29460 
gagtgagtct acgagagaac ctctccgagc tcagctactc catcagaaaa aacaccaccc 2 952 0 
tccttacctg ccgggaacgt acgagtgcgt caccggccgc tgcaccacac ctaccgcctg 29580 
accgtaaacc agactttttc cggacagacc tcaataactc tgtttaccag aacaggaggt 2 964 0 
gagcttagaa aacccttagg gtattaggcc aaaggcgcag ctactgtggg gtttatgaac 29700 
aattcaagca actctacggg ctattctaat tcaggtttct ctagaaatgg acggaattat 29760 
tacagagcag cgcctgctag aaagacgcag ggcagcggcc gagcaacagc gcatgaatca 29820 
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agagctccaa gacatggtta acttgcacca gtgcaaaagg ggtatctttt gtctggtaaa 29880 
gcaggccaaa gtcacctacg acagtaatac caccggacac cgccttagct acaagttgcc 2 994 0 
aaccaagcgt cagaaattgg tggtcatggt gggagaaaag cccattacca taactcagca 30000 
ctcggtagaa accgaaggct gcattcactc accttgtcaa ggacctgagg atctctgcac 30060 
ccttattaag accctgtgcg gtctcaaaga tcttattccc tttaactaat aaaaaaaaat 30120 
aataaagcat cacttactta aaatcagtta gcaaatttct gtccagttta ttcagcagca 30180 
cctccttgcc ctcctcccag ctctggtatt gcagcttcct cctggctgca aactttctcc 30240 
acaatctaaa tggaatgtca gtttcctcct gttcctgtcc atccgcaccc actatcttca 30300 
tgttgttgca gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 30360 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 30420 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 30480 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 3 054 0 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 30600 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 30660 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 30720 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 30780 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 30840 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 30900 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 3 0960 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 31020 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 31080 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 31140 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 31200 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 31260 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct acagccatag 31320 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 31380 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 31440 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 31500 
ataagctaac tttgtggacc acaccagctc catctcctaa ctgtagacta aatgcagaga 31560 
) aagatgctaa actcactttg gtcttaacaa aatgtggcag tcaaatactt gctacagttt 31620 
cagttttggc tgttaaaggc agtttggctc caatatctgg aacagttcaa agtgctcatc 31680 
ttattataag atttgacgaa aatggagtgc tactaaacaa ttccttcctg gacccagaat 31740 
attggaactt tagaaatgga gatcttactg aaggcacagc ctatacaaac gctgttggat 31800 
ttatgcctaa cctatcagct tatccaaaat ctcacggtaa aactgccaaa agtaacattg 31860 
tcagtcaagt ttacttaaac ggagacaaaa ctaaacctgt aacactaacc attacactaa 31920 
acggtacaca ggaaacagga gacacaactc caagtgcata ctctatgtca ttttcatggg 31980 
actggtctgg ccacaactac attaatgaaa tatttgccac atcctcttac actttttcat 3204 0 
acattgccca agaataaaga atcgtttgtg ttatgtttca acgtgtttat ttttcaattg 32100 
cagaaaattt caagtcattt ttcattcagt agtatagccc caccaccaca tagcttatac 32160 
agatcaccgt accttaatca aactcacaga accctagtat tcaacctgcc acctccctcc 32220 
caacacacag agtacacagt cctttctccc cggctggcct taaaaagcat catatcatgg 322 80 
gtaacagaca tattcttagg tgttatattc cacacggttt cctgtcgagc caaacgctca 32340 
tcagtgatat taataaactc cccgggcagc tcacttaagt tcatgtcgct gtccagctgc 3240 0 
tgagccacag gctgctgtcc aacttgcggt tgcttaacgg gcggcgaagg agaagtccac 32460 
gcctacatgg gggtagagtc ataatcgtgc atcaggatag ggcggtggtg ctgcagcagc 3252 0 
gcgcgaatcia actgctgccg ccgccgctcc gtcctgcagg aatacaacat ggcagtggtc 32580 
tcctcagcga tgattcgcac cgcccgcagc ataaggcgcc ttgtcctccg ggcacagcag 3 2 640 
cgcaccctga tctcacttaa atcagcacag taactgcagc acagcaccac aatattgttc 32700 
aaaatcccac agtgcaaggc gctgtatcca aagctcatgg cggggaccac agaacccacg 32760 
tggccatcat accacaagcg caggtagatt aagtggcgac ccctcataaa cacgctggac 32820 
ataaacatta cctcttttgg catgttgtaa ttcaccacct cccggtacca tataaacctc 32880 
tgattaaaca tggcgccatc caccaccatc ctaaaccagc tggccaaaac ctgcccgccg 32 94 0 
gctatacact gcagggaacc gggactggaa caatgacagt ggagagccca ggactcgtaa 33 00 0 
ccatggatca tcatgctcgt catgatatca atgttggcac aacacaggca cacgtgcata 33060 
cacttcctca ggattacaag ctcctcccgc gttagaacca tatcccaggg aacaacccat 33120 
tcctgaatca gcgtaaatcc cacactgcag ggaagacctc gcacgtaact cacgttgtgc 33180 
attgtcaaag tgttacattc gggcagcagc ggatgatcct ccagtatggt agcgcgggtt 33240 
tctgtctcaa aaggaggtag acgatcccta ctgtacggag tgcgccgaga caaccgagat 33300 
cgtgttggtc gtagtgtcat gccaaatgga acgccggacg tagtcatatt tcctgaagca 33360 
aaaccaggtg cgggcgtgac aaacagatct gcgtctccgg tctcgccgct tagatcgctc 3342 0 
tgtgtagtag ttgtagtata tccactctct caaagcatcc aggcgccccc tggcttcggg 33480 
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ttctatgtaa 
cacacccagc 
tggaagaacc 
ctattaagtg 
taatggcatt 
agtggacgta 
caaccatgcc 
gaatattaag 
agcagcgaat 
cggaacatta 
atcgtgcagg 
acccacactg 
agctttgttg 
cctcgcgcaa 
ccggaaccac 
taaacacaaa 
aaaaacaacc 
ctggtcaccg 
gtaagactcg 
agcccggggg 
taacaaaatt 
aaatagcacc 
tcagccttac 
tcaatcagtc 
atgacgtaac 
agaaacgaaa 
acgtcacttc 
cctaaaacct 
cctcattatc 



actccttcat 
caacctacac 
atgttttttt 
aacgcgctcc 
tgtaagatgt 
aaggctaaac 
caaataattc 
tccggccatt 
catgattgca 
acaaaaatac 
tctgcacgga 
attatgacac 
catgggcggc 
aaaagaaagc 
cacagaaaaa 
ataaaataac 
cttataagca 
tgattaaaaa 
gtaaacacat 
aatacatacc 
aataggagag 
ctcccgctcc 
cagtaaaaaa 
acagtgtaaa 
ggttaaagtc 
gccaaaaaac 
ccattttaat 
acgtcacccg 
atattggctt 



gcgccgctgc 
attcgttctg 
ttttattcca 
cctccggtgg 
tgcacaatgg 
ccttcagggt 
tcatctcgcc 
gtaaaaatct 
aaaattcagg 
cgcgatcccg 
ccagcgcggc 
gcatactcgg 
gatataaaat 
acatcgtagt 
gacaccattt 
aaaaaaacat 
taagacggac 
gcaccaccga 
caggttgatt 
cgcaggcgta 
aaaaacacat 
agaacaacat 
gaaaacchat 
aaagggccaa 
cacaaaaaac 
ccacaacttc 
taagaaaact 
ccccgttccc 
caatccaaaa 
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catcatcaat 

ttgtgacgtg 

gatgttgcaa 

gtgtgcgccg 

taaatttggg 

agtgaaatct 

gactttgacc 

acgcgtatct 

gttaggagtc 

cttatgcaat 

aaggagagaa 

tattaggaag 

cagagatatt 

cattggtgtg 

ctgttgggct 

aacccgtcgg 

cggatcggaa 

gatcccccgg 

gtagaagacc 

gtcgttttac 

gcacatcccc 

caacagttgc 



aatatacctt 
gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 
gaataatttt 
gtttacgtgg 
gaggggacta 
ccctcaggat 
actcttgtag 
aaagcaccgt 
gcaacagacg 
gtatttaagt 
cacctccggc 
cgcggttgag 
cctccgaacg 
aacctctcga 
gctgcaggaa 
ccaaggactt 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 



attttggatt 
tgggaacggg 
acacatgtaa 
gaagtgacaa 
taagatttgg 
gtgttactca 
agactcgccc 
gggtgtgttt 
atagtagttt 
tcttgcaaca 
gcatgccgat 
ggtctgacat 
gcctagctcg 
cctggccact 
gacaaactct 
gtactccgcc 
gaaaggcgtg 
ttcgatctag 
tccttcagaa 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 



cctgataaca 
cgagtcacac 
aaagattatc 
cgtggtcaaa 
cttccaaaag 
gaatctcctc 
accttctcaa 
gctccagagc 
ttcctcacag 
taggtccctt 
cacttccccg 
agctatgcta 
gcaaggtgct 
catgctcatg 
ttctctcaaa 
ttaaacatta 
tacggccatg 
cagctcctcg 
catcggtcag 
gagacaacat 
aaacacctga 
acagcgcttc 
taaaaaaaca 
gtgcagagcg 
acccagaaaa 
ctcaaatcgt 
acaattccca 
acgccccgcg 
taaggtatat 



tccaccaccg 
acgggaggag 
caaaacctca 
ctctacagcc 
gcaaacggcc 
tataaacatt 
tatatctcta 
gccctccacc 
acctgtataa 
cgcagggcca 
ccaggaacct 
accagcgtag 
gctcaaaaaa 
cagataaagg 
catgtctgcg 
gaagcctgtc 
ccggcgtgac 
gtcatgtccg 
tgctaaaaag 
tacagccccc 
aaaaccctcc 
cacagcggca 
ccactcgaca 
agtatatata 
ccgcacgcga 
cacttccgtt 
acacatacaa 
ccacgtcaca 
tattgatgat 



cagaataagc 
cgggaagagc 
aaatgaagat 
aaagaacaga 
ctcacgtcca 
ccagcacctt 
agcaaatccc 
ttcagcctca 
gattcaaaag 
gctgaacata 
tgacaaaaga 
ccccgatgta 
tcaggcaaag 
caggtaagct 
ggtttctgca 
ttacaacagg 
cgtaaaaaaa 
gagtcataat 
cgaccgaaat 
ataggaggta 
tgcctaggca 
gccataacag 
cggcaccagc 
ggactaaaaa 
acctacgccc 
ttcccacgtt 
gttactccgc 
aactccaccc 
9 



33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
34920 
34980 
35040 
35100 
35160 
35211 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 
tagcgcgtaa 
agggcgcgcc 
aggcgaaaag 
cgcttttgca 
tggtaacgat 
tggtggaagt 
ggattggacg 
atacaataaa 
ctcttccgca 
tcgcggtctt 
gccgagggac 
taaccagtca 
atggataaag 
ttgctaagtt 
cctggcgtta 
agcgaagagg 
cgctttgcct 



tgataatgag 
tagtagtgtg 
tggcaaaagt 
gttttaggcg 
gggaaaactg 
tatttgtcta 
ccgatgtacg 
cggggcttcg 
tagggagggg 
gagttagcaa 
aaggtggtac 
aaccactgaa 
cgccatttga 
tcgctgtctg 
tccagtactc 
ctgagcgagt 
cagtcgctct 
gtccaaaaaa 
ttttgagtga 
cccaacttaa 
cccgcaccga 
ggtttccggc 



ggggtggagt 
gcggaagtgt 
gacgtttttg 
gatgttgtag 
aataagagga 
gggccgcggg 
ggccagatat 
gttgtacgcg 
gaaatgtagt 
catgccttac 
gatcgtgcct 
ttccgcattg 
ccattcacca 
cgggggccag 
ttggatcgga 
ccgcatcgac 
agaactagtg 
gaagagaaag 
ttcactggcc 
tcgccttgca 
tcgcccttcc 
accagaagcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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gtgccggaaa gctggctgga gtgcgatctt 
aactggcaga tgcacggtta cgatgcgccc 
gtcaatccgc cgtttgttcc cacggagaat 
gttgatgaaa gctggctaca ggaaggccag 
gcgtttcatc tgtggtgcaa cgggcgctgg 
tctgaatttg acctgagcgc atttttacgc 
ctgcgttgga gtgacggcag ttatctggaa 
ttccgtgacg tctcgttgct gcataaaccg 
actcgcttta atgatgattt cagccgcgct 
gagttgcgtg actacctacg ggtaacagtt 
agcggcaccg cgcctttcgg cggtgaaatt 
gtcacactac gtctgaacgt cgaaaacccg 
tatcgtgcgg tggttgaact gcacaccgcc 
gatgtcggtt tccgcgaggt gcggattgaa 
ttgctgattc gaggcgttaa ccgtcacgag 
gagcagacga tggtgcagga tatcctgctg 
tgttcgcatt atccgaacca tccgctgtgg 
gtggtggatg aagccaatat tgaaacccac 
gatccgcgct ggctaccggc gatgagcgaa 
aatcacccga gtgtgatcat ctggtcgctg 
gacgcgctgt atcgctggat caaatctgtc 
ggcggagccg acaccacggc caccgatatt 
gaccagccct tcccggctgt gccgaaatgg 
gagacgcgcc cgctgatcct ttgcgaatac 
ttcgctaaat actggcaggc gtttcgtcag 
gactgggtgg atcagtcgct gattaaatat 
..ggcggtgatt ttggcgatac gccgaacgat 
.gccgaccgca cgccgcatcc agcgctgacg 
ttccgtttat ccgggcaaac catcgaagtg 
aacgagctcc tgcactggat ggtggcgctg 
cctctggatg tcgctccaca aggtaaacag 
gagagcgccg ggcaactctg gctcacagta 
tcagaagccg ggcacatcag cgcctggcag 
acgctccccg ccgcgtccca cgccatcccg 
atcgagctgg gtaataagcg ttggcaattt 
tggattggcg ataaaaaaca actgctgacg 
ctggataacg acattggcgt aagtgaagcg 
cgctggaagg cggcgggcca ttaccaggcc 
acacttgctg atgcggtgct gattacgacc 
ttatttatca gccggaaaac ctaccggatt 
gatgttgaag tggcgagcga tacaccgcat 
gcgcaggtag cagagcgggt aaactggctc 
cgccttactg ccgcctgttt tgaccgctgg 
tacgtcttcc cgagcgaaaa cggtctgcgc 
caccagtggc gcggcgactt ccagttcaac 
gaaaccagcc atcgccatct gctgcacgcg 
ttccatatgg ggattggtgg cgacgactcc 
ctgagcgccg gtcgctacca ttaccagttg 
aagcttatcg ataccgtcga aacttgttta 
atagcatcac aaatttcaca aataaagcat 
ccaaactcat caatgtatct tatcatgtct 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
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cctgaggccg atactgtcgt cgtcccctca 1380 
atctacacca acgtaaccta tcccattacg 1440 
ccgacgggtt gttactcgct cacatttaat 1500 
acgcgaatta tttttgatgg cgttaactcg 1560 
gtcggttacg gccaggacag tcgtttgccg 162 0 
gccggagaaa accgcctcgc ggtgatggtg 1680 
gatcaggata tgtggcggat gagcggcatt 1740 
actacacaaa tcagcgattt ccatgttgcc 1800 
gtactggagg ctgaagttca gatgtgcggc 1860 
tctttatggc agggtgaaac gcaggtcgcc 1920 
atcgatgagc gtggtggtta tgccgatcgc 1980 
aaactgtgga gcgccgaaat cccgaatctc 2040 
gacggcacgc tgattgaagc agaagcctgc 2100 
aatggtctgc tgctgctgaa cggcaagccg 216 0 
catcatcctc tgcatggtca ggtcatggat 222 0 
atgaagcaga acaactttaa cgccgtgcgc 2280 
tacacgctgt gcgaccgcta cggcctgtat 2340 
ggcatggtgc caatgaatcg tctgaccgat 2400 
cgcgtaacgc gaatggtgca gcgcgatcgt 2460 
gggaatgaat caggccacgg cgctaatcac 2520 
gatccttccc gcccggtgca gtatgaaggc 2580 
atttgcccga tgtacgcgcg cgtggatgaa 2640 
tccatcaaaa aatggctttc gctacctgga 270 0 
gcccacgcga tgggtaacag tcttggcggt 2760 
tatccccgtt tacagggcgg cttcgtctgg 2820 
gatgaaaacg gcaacccgtg gtcggcttac 2 880 
cgccagttct gtatgaacgg tctggtcttt 2940 
gaagcaaaac accagcagca gtttttccag 3000 
accagcgaat acctgttccg tcatagcgat 3 060 
gatggtaagc cgctggcaag cggtgaagtg 3120 
ttgattgaac tgcctgaact accgcagccg 3180 
cgcgtagtgc aaccgaacgc gaccgcatgg 3240 
cagtggcgtc tggcggaaaa cctcagtgtg 3300 
catctgacca ccagcgaaat ggatttttgc 3360 
aaccgccagt caggctttct ttcacagatg 3420 
ccgctgcgcg atcagttcac ccgtgcaccg 3480 
acccgcattg accctaacgc ctgggtcgaa 3540 
gaagcagcgt tgttgcagtg cacggcagat 3600 
gctcacgcgt ggcagcatca ggggaaaacc 3 660 
gatggtagtg gtcaaatggc gattaccgtt 3720 
ccggcgcgga ttggcctgaa ctgccagctg 37 80 
ggattagggc cgcaagaaaa ctatcccgac 3840 
gatctgccat tgtcagacat gtataccccg 3900 
tgcgggacgc gcgaattgaa ttatggccca 3 960 
atcagccgct acagtcaaca gcaactgatg 4020 
gaagaaggca catggctgaa tatcgacggt 40 80 
tggagcccgt cagtatcggc ggaatttcag 414 0 
gtctggtgtc aaaaataata atctcgaatc 42 0 0 
ttgcagctta taatggttac aaataaagca 4260 
ttttttcact gcattctagt tgtggtttgt 432 0 
ggatccgacc tcggatctgg aaggtgctga 43 8 0 
cctgcgagtg tggcggtaaa catattagga 444 0 
agctgaggcc cgatcacttg gtgctggcct 45 00 
aagatacaga ttgaggtact gaaatgtgtg 45 60 
aaggtggggg tcttatgtag ttttgtatct 462 0 
ccaactcgtt tgatggaagc attgtgagct 46 8 0 
ccggggtgcg tcagaatgtg atgggctcca 474 0 
actctactac cttgacctac gagaccgtgt 48 0 0 
ccgccgcttc agccgctgca gccaccgccc 4860 
gcccgcttgc aagcagtgca gcttcccgtt 4 92 0 
ttttggcaca attggattct ttgacccggg 498 0 
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aacttaatgt cgtttctcag cagctgttgg atctgcgcca gcaggtttct gccctgaagg 5040 
cttcctcccc tcccaatgcg gtttaaaaca taaataaaaa accagactct gtttggattt 5100 
ggatcaagca agtgtcttgc tgtctttatt taggggtttt gcgcgcgcgg taggcccggg 5160 
accagcggtc tcggtcgttg agggtcctgt gtattttttc caggacgtgg taaaggtgac 5220 
tctggatgtt cagatacatg ggcataagcc cgtctctggg gtggaggtag caccactgca 5280 
gagcttcatg ctgcggggtg gtgttgtaga tgatccagtc gtagcaggag cgctgggcgt 5340 
ggtgcctaaa aatgtctttc agtagcaagc tgattgccag gggcaggccc ttggtgtaag 54 0 0 
tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 5460 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 5520 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 558 0 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 564 0 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 570 0 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 5760 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 5820 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 5880 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 594 0 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccggctgc aactggtagt 60 0 0 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 6060 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 6120 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 6180 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 6240 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg cfcttcgctgt acggcagtag 63 0 0 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 6360 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 6420 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 6480 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 6540 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 6600 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 6660 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 6720 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 6780 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 6840 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 6900 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 6960 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 7020 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 7080 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 7140 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 72 00 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 7260 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 7320 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 73 80 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 7440 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 7500 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 7560 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 7620 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 7680 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 7740 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 7800 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 7860 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 7920 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 7980 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 8040 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 8100 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 8160 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 8220 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 82 80 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 8340 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 84 00 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 8460 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 8520 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 85 80 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 864 0 
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aagttcctcg 
tgcaagatga 
caggtggtcg 
gtagaaggta 
cgcggcagtc 
ctgcttccca 
ctcggtgcga 
gtggctattg 
tttgtaaaaa 
gacctgacga 
tggctggtgg 
tacggtggat. 
tcggagcttg 
cgtcaggtca 
tagatccagg 
gaggccgcat 
gtccttggat 
tccggacccg 
tgctgcgcgc 
cgcctctgcg 
tcaatttcgg 
tcttgatagg 
ccggctcgct 
gcgttgaggc 
gcgcgcatga 
cgcaggcgct 
ataacccagc 
gcctcgtaga 
tcctcctcca 
acaggggcct 
tctggcggcg 
acaaagcgct 
ttctcgcggg 
gggctgccat 
actccgccgc 
aaggcgtcta 
cggcggtcgg 
ttgagacggc 
aggcggtcgg 
tgcatgagcc 
tctatcgctg 
gtgaccccga 
aatatggcct 
tggtatgcgc 
tggtgacccg 
acgtagtcgt 
tggcggtaga 
aggcgatgat 
gcgcgcggaa 
gtcgggacgc 
gagagcctgt 
gcggacgacc 
cccgcgtgtc 
cttccaggcg 
cggttaggct 
tccaagggtt 
cgggggtttg 

gagccccttt 
cagcagcggc 
gcgtcaggag 
c 99cgccggg 



taggtgagct 
gggttggaag 
cgaaaggtcc 
agcgggtctt 
actagaggct 
aaggccccca 
ggatgcgagc 
atgtggtgaa 
cgtgcgcagt 
ccgcgcacaa 
tcttctactt 
cggaccacca 
atgacaacat 
ggcgggagct 
tgatacctaa 
ccccgcggcg 
gatgcatcta 
ccgggagagg 
gtaggttgct 
tgaagacgac 
tgtcgttgac 
cgatctcggc 
ccacggtggc 
ctccctcgtt 
ccacctgcgc 
gaaagaggta 
gtcgcaacgt 
agtccacggc 
gaagacggat 
cttcttcttc 
gtgggggagg 
cgatcatctc 
ggcgcagttg 
gcggcaggga 
cgagggacct 
accagtcaca 
ggttgtttct 
ggatggtcga 
ccatgcccca 
tttctaccgg 
cggcggcggc 
agcccctcat 
gctgcacctg 
ccgtgttgat 
gctgcgagag 
tgcaagtccg 
ggggccagcg 
atccgtagat 
agtcgcggac 
tctggccggt 
aagcgggcac 

ggggttcgag 

gaacccaggt 
cggcggctgc 
ggaaagcgaa 
gagtcgcggg 
cctccccgtc 
tttgcttttc 
aagagcaaga 
gggcgacatc 
cccggcacta 



cttcagggga 
cgacgaatga 
taaactggcg 
gttcccagcg 
catctccgcc 
tccaagtata 
cgatcgggaa 
agtagaagtc 
actggcagcg 
ggaagcagag 
cggctgcttg 
cgccgcgcga 
cgcgcagatg 
cctgcaggtt 
tttccagggg 
cgactacggt 
aaagcggtga 
gggcaggggc 
ggcgaacgcg 
gggcccggtg 
ggcggcctgg 
catgaactgc 
ggcgaggtcg 
ccagacgcgg 
gagattgagc 
gttgagggtg 
ggattcgttg 
gaagttgaaa 
gagctcggcg 
ttcaatctcc 
ggggacacgg 
cccgcggcga 
gaagacgccg 
tacggcgcta 
gagcgagt cc 
gtcgcaaggt 
ggcggaggtg 
cagaagcacc 
ggcttcgttt 
cacttcttct 
ggagtttggc 
cggctgaagc 
cgtgagggta 
ggtgtaagtg 
ctcggtgtac 
caccaggtac 
tagggtggcc 
gtacctggac 
gcggttccag 
caggcgcgcg 
tcttccgtgg 
ccccgtatcc 
gtgcgacgtc 
tgcgctagct 
agcattaagt 
acccccggtt 
atgcaagacc 
ccagatgcat 
gcagcggcag 
cgcggttgac 
cctggacttg 



gctgagcccg 
gctccacagg 
acctatggcc 
gtcccatcca 
gaacttcatg 
ggtctctaca 
gaactggatc 
cctgcgacgg 
gtgcacgggc 
tgggaatttg 
tccttgaccg 
gcccaaagtc 
ggagctgtcc 
tacctcgcat 
ctggttggtg 
accgcgcggc 
cgcgggcgag 
acgtcggcgc 
acgacgcggc 
agcttgagcc 
cgcaaaatct 
tcgatctctt 
ttggaaatgc 
ctgtagacca 
tccacgtgcc 
gtggcggtgt 
atatccccca 
aactgggagt 
acagtgtcgc 
tcttccataa 
cggcgacgac 
cggcgcatgg 
cccgtcatgt 
acgatgcatc 
gcatcgaccg 
aggctgagca 
ctgctgatga 
atgtccttgg 
tgacatcggc 
tctccttcct 
cgtaggtggc 
agggctaggt 
gactggaagt 
cagttggcca 
ctgagacgcg 
tggtatccca 

ggggctccgg 

atccaggtga 
atgttgcgca 
caatcgttga 
tctggtggat 
ggccgtccgc 
agacaacggg 
tttttggcca 
ggctcgctcc 
cgagtctcgg 
ccgcttgcaa 
ccggtgctgc 
acatgcaggg 
gcggcagcag 
gaggagggcg 



tgctctgaaa 
tcacgggcca 
attttttctg 
aggttcgcgg 
accagcatga 
tcgtaggtga 
tcccgccacc 
gccgaacact 
tgtacatcct 
agcccctcgc 
tctggctgct 
cagatgtccg 
atggtctgga 
agacgggtca 
gcggcgtcga 
gggcggtggg 
cccccggagg 
cgcgcgcggg 
ggttgatctc 
tgaaagagag 
cctgcacgtc 
cctcctggag 
gggccatgag 
cgcccccttc 
gggcgaagac 
gttctgccac 
aggcctcaag 
tgcgcgccga 
gcacctcgcg 
gggcctcccc 
ggcgcaccgg 
tctcggtgac 
cccggttatg 
tcaacaattg 
gatcggaaaa 
ccgtggcggg 
tgtaattaaa 
gtccggcctg 
gcaggtcttt 
cttgtcctgc 
gccctcttcc 
cggcgacaac 
catccatgtc 
taacggacca 
agtaagccct 
ccaaaaagtg 
gggcgagatc 
tgccggcggc 
gcggcaaaaa 
cgctctagac 
aaattcgcaa 
cgtgatccat 
ggagtgctcc 
ctggccgcgc 
ctgtagccgg 
accggccgga 
attcctccgg 
ggcagatgcg 
caccctcccc 
atggtgatta 
agggcctggc 



gggcccagtc 
ttagcatttg 
gggtgatgca 
ctaggtctcg 
agggcacgag 
caaagagacg 
aattggagga 
cgtgctggct 
gcacgaggtt 
ctggcgggtt 
cgaggggagt 
cgcgcggcgg 
gctcccgcgg 
gggcgcgggc 
tggcttgcaa 
ccgcgggggt 
tagggggggc 
caggagctgg 
ctgaatctgg 
ttcgacagaa 
tcctgagttg 
atctccgcgt 
ctgcgagaag 
ggcatcgcgg 
ggcg;tagttt 
gaagaagtac 
gcgctccatg 
cacggttaac 
ctcaaaggct 
ttcttcttct 
gaggcggtcg 
ggcgcggccg 
ggttggcggg 
ttgtgtaggt 
cctctcgaga 
cggcagcggg 
gtaggcggtc 
ctgaatgcgc 
gtagtagtct 
atctcttgca 
tcccatgcgt 
gcgctcggct 
cacaaagcgg 
gttaacggtc 
cgagtcaaat 
cggcggcggc 
ttccaacata 

ggtggtggag 

gtgctccatg 
cgtgcaaaag 
gggtatcatg 
gcggttaccg 
ttttggcttc 
gcagcgtaag 
agggttattt 
ctgcggcgaa 
aaacagggac 
cccccctcct 
tcctcctacc 
cgaacccccg 
gcggctagga 
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gcgccctctc ctgagcggta cccaagggtg cagctgaagc gtgatacgcg tgaggcgtac 123 60 
gtgccgcggc agaacctgtt tcgcgaccgc gagggagagg agcccgagga gatgcgggat 12420 
cgaaagttcc acgcagggcg cgagctgcgg catggcctga atcgcgagcg gttgctgcgc 124 80 
gaggaggact ttgagcccga cgcgcgaacc gggattagtc ccgcgcgcgc acacgtggcg 1254 0 
gccgccgacc tggtaaccgc atacgagcag acggtgaacc aggagattaa ctttcaaaaa 12600 
agctttaaca accacgtgcg tacgcttgtg gcgcgcgagg aggtggctat aggactgatg 1266 0 
catctgtggg actttgtaag cgcgctggag caaaacccaa atagcaagcc gctcatggcg 1272 0 
cagctgttcc ttatagtgca gcacagcagg gacaacgagg cattcaggga tgcgctgcta 12780 
aacatagtag agcccgaggg ccgctggctg ctcgatttga taaacatcct gcagagcata 12 84 0 
gtggtgcagg agcgcagctt gagcctggct gacaaggtgg ccgccatcaa ctattccatg 12900 
cttagcctgg gcaagtttta cgcccgcaag atataccata ccccttacgt tcccatagac 12960 
aaggaggtaa agatcgaggg gttctacatg cgcatggcgc tgaaggtgct taccttgagc 13 02 0 
gacgacctgg gcgtttatcg caacgagcgc atccacaagg ccgtgagcgt gagccggcgg 13080 
cgcgagctca gcgaccgcga gctgatgcac agcctgcaaa gggccctggc tggcacgggc 13140 
agcggcgata gagaggccga gtcctacttt gacgcgggcg ctgacctgcg ctgggcccca 13200 
agccgacgcg ccctggaggc agctggggcc ggacctgggc tggcggtggc acccgcgcgc 13260 
gctggcaacg tcggcggcgt ggaggaatat gacgaggacg atgagtacga gccagaggac 13320 
ggcgagtact aagcggtgat gtttctgatc agatgatgca agacgcaacg gacccggcgg 133 80 
tgcgggcggc gctgcagagc cagccgtccg gccttaactc cacggacgac tggcgccagg 13440 
tcatggaccg catcatgtcg ctgactgcgc gcaatcctga cgcgttccgg cagcagccgc 13500 
aggccaaccg gctctccgca attctggaag cggtggtccc ggcgcgcgca aaccccacgc 13560 
acgagaaggt gctggcgatc gtaaacgcgc tggccgaaaa cagggccatc cggcccgacg 13620 
aggccggcct ggtctacgac gcgctgcttc agcgcgtggc tcgttacaac agcggcaacg 13680 
tgcagaccaa cctggaccgg ctggtggggg atgtgcgcga ggccgtggcg cagcgtgagc 13740 
gcgcgcagca gcagggcaac ctgggctcca tggttgcact aaacgccttc ctgagtacac 13 800 
agcccgccaa cgtgccgcgg ggacaggagg actacaccaa ctttgtgagc gcactgcggc 13 860 
taatggtgac tgagacaccg caaagtgagg tgtaccagtc tgggccagac tattttttcc 13920 
agaccagtag acaaggcctg cagaccgtaa acctgagcca ggctttcaaa aacttgcagg 13980 
ggctgtgggg ggtgcgggct cccacaggcg accgcgcgac cgtgtctagc ttgctgacgc 1404 0 
ccaactcgcg cctgttgctg ctgctaatag cgcccttcac ggacagtggc agcgtgtccc 14100 
gggacacata cctaggtcac ttgctgacac tgtaccgcga ggccataggt caggcgcatg 14160 
tggacgagca tactttccag gagattacaa gtgtcagccg cgcgctgggg caggaggaca 1422 0 
cgggcagcct ggaggcaacc ctaaactacc tgctgaccaa ccggcggcag aagatcccct 142 80 
cgttgcacag tttaaacagc gaggaggagc gcafctttgcg ctacgtgcag cagagcgtga 14340 
gccttaacct gatgcgcgac ggggtaacgc ccagcgtggc gctggacatg accgcgcgca 14400 
acatggaacc gggcatgtat gcctcaaacc ggccgtttat caaccgccta atggactact 14460 
tgcatcgcgc ggccgccgtg aaccccgagt atttcaccaa tgccatcttg aacccgcact 14520 
ggctaccgcc ccctggtttc tacaccgggg gattcgaggt gcccgagggt aacgatggat 14580 
tcctctggga cgacatagac gacagcgtgt tttccccgca accgcagacc ctgctagagt 1464 0 
tgcaacagcg cgagcaggca gaggcggcgc tgcgaaagga aagcttccgc aggccaagca 14700 
gcttgtccga tctaggcgct gcggccccgc ggtcagatgc tagtagccca tttccaagct 14760 
tgatagggtc tcttaccagc actcgcacca cccgcccgcg cctgctgggc gaggaggagt 14820 
acctaaacaa ctcgctgctg cagccgcagc gcgaaaaaaa cctgcctccg gcatttccca 14 880 
acaacgggat agagagccta gtggacaaga tgagtagatg gaagacgtac gcgcaggagc 14940 
acagggacgt gccaggcccg cgcccgccca cccgtcgtca aaggcacgac cgtcagcggg 150 0 0 
gtctggtgtg ggaggacgat gactcggcag acgacagcag cgtcctggat ttgggaggga 15060 
gtggcaaccc gtttgcgcac cttcgcccca ggctggggag aatgttttaa aaaaaaaaaa 15120 
gcatgatgca aaataaaaaa ctcaccaagg ccatggcacc gagcgttggt tttcttgtat 15180 
tccccttagt atgcggcgcg cggcgatgta tgaggaaggt cctcctccct cctacgagag 15240 
tgtggtgagc gcggcgccag tggcggcggc gctgggttct cccttcgatg ctcccctgga 153 00 
cccgccgttt gtgcctccgc ggtacctgcg gcctaccggg gggagaaaca gcatccgtta 15360 
ctctgagttg gcacccctat tcgacaccac ccgtgtgtac ctggtggaca acaagtcaac 1542 0 
ggatgtggca tccctgaact accagaacga ccacagcaac tttctgacca cggtcattca 154 80 
aaacaatgac tacagcccgg gggaggcaag cacacagacc atcaatcttg acgaccggtc 15540 
gcactggggc ggcgacctga aaaccatcct gcataccaac atgccaaatg tgaacgagtt 15600 
catgtttacc aataagttta aggcgcgggt gatggtgtcg cgcttgccta ctaaggacaa 15660 
tcaggtggag ctgaaatacg agtgggtgga gttcacgctg cccgagggca actactccga 1572 0 
gaccatgacc atagacctta tgaacaacgc gatcgtggag cactacttga aagtgggcag 15780 
acagaacggg gttctggaaa gcgacatcgg ggtaaagttt gacacccgca acttcagact 15 840 
ggggtttgac cccgtcactg gtcttgtcat gcctggggta tatacaaacg aagccttcca 15900 
tccagacatc attttgctgc caggatgcgg ggtggacttc acccacagcc gcctgagcaa 15960 
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cttgttgggc atccgcaagc ggcaaccctt ccaggagggc tttaggatca cctacgatga 16020 
tctggagggt ggtaacattc ccgcactgtt ggatgtggac gcctaccagg cgagcttgaa 16080 
agatgacacc gaacagggcg ggggtggcgc aggcggcagc aacagcagtg gcagcggcgc 16140 
ggaagagaac tccaacgcgg cagccgcggc aatgcagccg gtggaggaca tgaacgatca 16200 
tgccattcgc ggcgacacct ttgccacacg ggctgaggag aagcgcgctg aggccgaagc 16260 
agcggccgaa gctgccgccc ccgctgcgca acccgaggtc gagaagcctc agaagaaacc 16320 
ggtgatcaaa cccctgacag aggacagcaa gaaacgcagt tacaacctaa taagcaatga 16380 
cagcaccttc acccagtacc gcagctggta ccttgcatac aactacggcg accctcagac 16440 
cggaatccgc tcatggaccc tgctttgcac tcctgacgta acctgcggct cggagcaggt 16500 
ctactggtcg ttgccagaca tgatgcaaga ccccgtgacc ttccgctcca cgcgccagat 16560 
cagcaacttt ccggtggtgg gcgccgagct gttgcccgtg cactccaaga gcttctacaa 16620 
cgaccaggcc gtctactccc aactcatccg ccagtttacc tctctgaccc acgtgttcaa 16680 
tcgctttccc gagaaccaga ttttggcgcg cccgccagcc cccaccatca ccaccgtcag 1674 0 
tgaaaacgtt cctgctctca cagatcacgg gacgctaccg ctgcgcaaca gcatcggagg 16800 
agtccagcga gtgaccatta ctgacgccag acgccgcacc tgcccctacg tttacaaggc 16860 
cctgggcata gtctcgccgc gcgtcctatc gagccgcact ttttgagcaa gcatgtccat 16920 
ccttatatcg cccagcaata acacaggctg gggcctgcgc ttcccaagca agatgtttgg 16980 
cggggccaag aagcgctccg accaacaccc agtgcgcgtg cgcgggcact accgcgcgcc 17040 
ctggggcgcg cacaaacgcg gccgcactgg gcgcaccacc gtcgatgacg ccatcgacgc 17100 
ggtggtggag gaggcgcgca actacacgcc cacgccgcca ccagtgtcca cagtggacgc 17160 
ggccattcag accgtggtgc gcggagcccg gcgctatgct aaaatgaaga gacggcggag 17220 
gcgcgtagca cgtcgccacc gccgccgacc cggcactgcc gcccaacgcg cggcggcggc 1728 0 
cctgcttaac cgcgcacgtc gcaccggccg acgggcggcc atgcgggccg ctcgaaggct 17340 
ggccgcgggt attgtcactg tgccccccag gtccaggcga cgagcggccg ccgcagcagc 17400 
cgcggccatt agtgctatga ctcagggtcg caggggcaac gtgtattggg tgcgcgactc 17460 
ggttagcggc ctgcgcgtgc ccgtgcgcac ccgccccccg cgcaactaga ttgcaagaaa 17520 
aaactactta gactcgtact gttgtatgta tccagcggcg gcggcgcgca acgaagctat 17580 
gtccaagcgc aaaatcaaag aagagatgct ccaggtcatc gcgccggaga tctatggccc 17640 
cccgaagaag gaagagcagg attacaagcc ccgaaagcta aagcgggtca aaaagaaaaa 17700 
:gaaagatgat gatgatgaac ttgacgacga ggtggaactg ctgcacgcta ccgcgcccag 17760 
gcgacgggta cagtggaaag gtcgacgcgt aaaacgtgtt ttgcgacccg gcaccaccgt 17820 
agtctttacg cccggtgagc gctccacccg cacctacaag cgcgtgtatg atgaggtgta 17880 
cggcgacgag gacctgcttg agcaggccaa cgagcgcctc ggggagtttg cctacggaaa 17940 
gcggcataag gacatgctgg cgttgccgct ggacgagggc aacccaacac ctagcctaaa 18000 
gcccgtaaca ctgcagcagg tgctgcccgc gcttgcaccg tccgaagaaa agcgcggcct 18060 
aaagcgcgag tctggtgact tggcacccac cgtgcagctg atggtaccca agcgccagcg 1812 0 
actggaagat gtcttggaaa aaatgaccgt ggaacctggg ctggagcccg aggtccgcgt 18180 
gcggccaatc aagcaggtgg cgccgggact gggcgtgcag accgtggacg ttcagatacc 1824 0 
cactaccagt agcaccagta ttgccaccgc cacagagggc atggagacac aaacgtcccc 18300 
ggttgcctca gcggtggcgg atgccgcggt gcaggcggtc gctgcggccg cgtccaagac 18360 
ctctacggag gtgcaaacgg acccgtggat gtttcgcgtt tcagcccccc ggcgcccgcg 1842 0 
cggttcgagg aagtacggcg ccgccagcgc gctactgccc gaatatgccc tacatccttc 18480 
cattgcgcct acccccggct atcgtggcta cacctaccgc cccagaagac gagcaactac 18540 
ccgacgccga accaccactg gaacccgccg ccgccgtcgc cgtcgccagc ccgtgctggc 18600 
cccgatttcc gtgcgcaggg tggctcgcga aggaggcagg accctggtgc tgccaacagc 186 6 0 
gcgctaccac cccagcatcg tttaaaagcc ggtctttgtg gttcttgcag atatggccct 1872 0 
cacctgccgc ctccgtttcc cggtgccggg attccgagga agaatgcacc gtaggagggg 18780 
catggccggc cacggcctga cgggcggcat gcgtcgtgcg caccaccggc ggcggcgcgc 1884 0 
gtcgcaccgt cgcatgcgcg gcggtatcct gcccctcctt attccactga tcgccgcggc 1890 0 
gattggcgcc gtgcccggaa ttgcatccgt ggccttgcag gcgcagagac actgattaaa 18960 
aacaagttgc atgtggaaaa atcaaaataa aaagtctgga ctctcacgct cgcttggtcc 1902 0 
tgtaactatt ttgtagaatg gaagacatca actttgcgtc tctggccccg cgacacggct 19080 
cgcgcccgtt catgggaaac tggcaagata tcggcaccag caatatgagc ggtggcgcct 1914 0 
tcagctgggg ctcgctgtgg agcggcatta aaaatttcgg ttccaccgtt aagaactatg 192 0 0 
gcagcaaggc ctggaacagc agcacaggcc agatgctgag ggataagttg aaagagcaaa 192 60 
atttccaaca aaaggtggta gatggcctgg cctctggcat tagcggggtg gtggacctgg 1932 0 
ccaaccaggc agtgcaaaat aagattaaca gtaagcttga tccccgccct cccgtagagg 193 80 
agcctccacc ggccgtggag acagtgtctc cagaggggcg tggcgaaaag cgtccgcgcc 1944 0 
ccgacaggga agaaactctg gtgacgcaaa tagacgagcc tccctcgtac gaggaggcac 19500 
taaagcaagg cctgcccacc acccgtccca tcgcgcccat ggctaccgga gtgctgggcc 19560 
agcacacacc cgtaacgctg gacctgcctc cccccgccga cacccagcag aaacctgtgc 19620 
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tgccaggccc 
ccagcggtcc 
gcatcgtggg 
tgtcgtatgt 
gcgcccgctt 
ctcgggccag 
cgagacgtac 
cgtgaccaca 
tactgcgtac 
catggcttcc 
ctactctggc 
ggatgaagct 
cgaagtagac 
tggtataaat 
tgccgataaa 
aattaatcat 
ttcatatgca 
tggaaagcta 
caatggtgat 
cccagacact 
gggccaacaa 
tctaatgtat 
gaatgctgtt 
ttccattggt 
tccagatgtt 
tccactggga 
ggaaaatgga 
aaataatttt 
caacatagcg 
tgataaccca 
ctgctacatt 
taaccaccac 
tgtgcccttc 
gccgggctca 
ctccctagga 
ttacgccacc 
aaacgacacc 
tatacccgcc 
ccgcggctgg 
cgacccttat 
ccacaccttt 
ccgcctgctt 
cgttgcccag 
cattggctac 
tagaaacttc 
acaggtgggc 
catgcgcgaa 
agttgacagc 
ctccagtaac 
caactccgcc 
tctttatgtt 
catcgaaacc 
caagcaacat 
gtcaaagatc 
tttgtttctc 
ggcgtacact 
ccctttggct 
ctgcgccgta 
caaagcgtac 
gcctttgcca 

ggggtaccca 



gaccgccgtt 
gcgatcgttg 
tctgggggtg 
gtgtcatgta 
tccaagatgg 
gacgcctcgg 
ttcagcctga 
gaccggtccc 
tcgtacaagg 
acgtactttg 
actgcctaca 
gctactgctc 
gagcaagctg 
attacaaagg 
acatttcaac 
gcagctggga 
aaacccacaa 
gaaagtcaag 
aacttgactc 
catatttctt 
tctatgccca 
tacaacagca 
gtagatttgc 
gatagaacca 
agaattattg 
ggtgtgatta 
tgggaaaaag 
gccatggaaa 
ctgtatttgc 
aacacctacg 
aaccttggag 
cgcaatgctg 
cacatccagg 
tacacctacg 
aatgacctaa 
ttcttcccca 
aacgaccagt 
aacgctacca 
gccttcacgc 
tacacctact 
aagaaggtgg 
acccccaacg 
tgtaacatga 
cagggcttct 
cagcccatga 
atcctacacc 
ggacaggcct 
attacccaga 
tttatgtcca 
cacgcgctag 
ttgtttgaag 
gtgtacctgc 
caacaacagc 
ttggttgtgg 
cacacaagct 
ggatggcctt 
tttctgacca 
gcgccattgc 
aggggcccaa 
actggcccca 
actccatgct 



gttgtaaccc 
cggcccgtag 
caatccctga 
tgcgtccatg 
ctaccccttc 
agtacctgag 
ataacaagtt 
agcgtttgac 
cgcggttcac 
acatccgcgg 
acgccctggc 
ttgaaataaa 
agcagcaaaa 
agggtattca 
ctgaacctca 
gagtccttaa 
atgaaaatgg 
tggaaatgca 
ctaaagtggt 
acatgcccac 
acaggcctaa 
cgggtaatat 
aagacagaaa 
ggtacttttc 
aaaatcatgg 
atacagagac 
atgctacaga 
tcaatctaaa 
ccgacaagct 
actacatgaa 
cacgctggtc 
gcctgcgcta 
tgcctcagaa 
agtggaactt 
gggttgacgg 
tggcccacaa 
cctttaacga 
acgtgcccat 
gccttaagac 
ctggctctat 
ccattacctt 
agtttgaaat 
ccaaagactg 
atatcccaga 
gccgtcaggt 
aacacaacaa 
accctgctaa 
aaaagtttct 
tgggcgcact 
acatgacttt 
tctttgacgt 
gcacgccctt 
tgccgccatg 
gccatatttt 
cgcctgcgcc 
tgcctggaac 
gcgactcaag 
ttcttccccc 
ctcggccgcc 
aactcccatg 
caacagtccc 



gtcctagccg 
ccagtggcaa 
agcgccgacg 
tcgccgccag 
gatgatgccg 
ccccgggctg 
tagaaacccc 
gctgcggttc 
cctagctgtg 
cgtgctggac 
tcccaagggt 
cctagaagaa 
aactcacgta 
aataggtgtc 
aataggagaa 
aaagactacc 
agggcaaggc 
atttttctca 
attgtacagt 
tattaaggaa 
ttacattgct 
gggtgttctg 
cacagagctt 
tatgtggaat 
aactgaagat 
tcttaccaag 
attttcagat 
tgccaacctg 
aaagtacagt 
caagcgagtg 
ccttgactat 
ccgctcaatg 
gttctttgcc 
caggaaggat 
agccagcatt 
caccgcctcc 
ctatctctcc 
atccatcccc 
taaggaaacc 
accctaccta 
tgactcttct 
taagcgctca 
gttcctggta 
gagctacaag 
ggtggatgat 
ctctggattt 
cttcccctat 
ttgcgatcgc 
cacagacctg 
tgaggtggat 
ggtccgtgtg 
ctcggccggc 
ggctccagtg 
ttgggcacct 
atagtcaata 
ccgcactcaa 
caggtttacc 
gaccgctgta 
tgtggactat 
gatcacaacc 
caggtacagc 



cgcgtccctg 
ctggcaaagc 
atgcttctga 
aggagctgct 
cagtggtctt 
gtgcagtttg 
acggtggcgc 
atccctgtgg 
ggtgataacc 
aggggcccta 
gccccaaatc 
gaggacgatg 
tttgggcagg 
gaaggtcaaa 
tctcagtggt 
ccaatgaaac 
attcttgtaa 
actactgagg 
gaagatgtag 
ggtaactcac 
tttagggaca 
gcgggccaag 
tcataccagc 
caggctgttg 
gaacttccaa 
gtaaaaccta 
aaaaatgaaa 
tggagaaatt 
ccttccaacg 
gtggctcccg 
atggacaacg 
ttgctgggca 
attaaaaacc 
gtfcaacatgg 
aagtttgata 
acgcttgagg 
gccgccaaca 
tcccgcaact 
ccatcactgg 
gatggaacct 
gtcagctggc 
gttgacgggg 
caaatgctag 
gaccgcatgt 
actaaataca 
gttggctacc 
ccgcttatag 
accctttggc 
ggccaaaacc 
cccatggacg 
caccggccgc 
aacgccacaa 
agcaggaact 
atgacaagcg 
cggccggtcg 
aaacatgcta 
agtttgagta 
taacgctgga 
tctgctgcat 
ccaccatgaa 
ccaccctgcg 



cgccgcgccg 
acactgaaca 
atagctaacg 
gagccgccgc 
acatgcacat 
cccgcgccac 
ctacgcacga 
accgtgagga 
gtgtgctgga 
cttttaagcc 
cttgcgaatg 
acaacgaaga 
cgccttattc 
cacctaaata 
acgaaactga 
catgttacgg 
agcaacaaaa 
cgaccgcagg 
atatagaaac 
gagaactaat 
attttattgg 
catcgcagtt 
ttttgcttga 
acagctatga 
attactgctt 
aaacaggtca 
taagagttgg 
tcctgtactc 
taaaaatttc 
ggttagtgga 
tcaacccatt 
atggtcgcta 
tccttctcct 
ttctgcagag 
gcatttgcct 
ccatgcttag 
tgctctaccc 
gggcggcttt 
gctcgggcta 
tttacctcaa 
ctggcaatga 
agggttacaa 
ctaactacaa 
actccttctt 
aggactacca 
ttgcccccac 
gcaagaccgc 
gcatcccatt 
ttctctacgc 
agcccaccct 
accgcggcgt 
cataaagaag 
gaaagccatt 
ctttccaggc 
cgagactggg 
cctctttgag 
cgagtcactc 
aaagtccacc 
gtttctccac 
ccttattacc 
tcgcaaccag 



19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 



WO 03/062400 



PCT/US03/02295 



-31- 



gaacagctct acagcttcct ggagcgccac tcgccctact tccgcagcca cagtgcgcag 23340 
attaggagcg ccacttcttt ttgtcacttg aaaaacatgt aaaaataatg tactagagac 234 0 0 
actttcaata aaggcaaatg cttttatttg tacactctcg ggtgattatt tacccccacc 23460 
cttgccgtct gcgccgtttg gggaggcggc ggcgacgggg acggggacga cacgtcctcc 23520 
atggttgggg gacgtcgcgc cgcaccgcgt ccgcgctcgg gggtggtttc gcgctgctcc 23580 
tcttcccgac tggccatttc cttctcctat aggcagaaaa agatcatgga gtcagtcgag 23640 
aagaaggaca gcctaaccgc cccctctgag ttcgccacca ccgcctccac cgatgccgcc 2370 0 
aacgcgccta ccaccttccc cgtcgaggca cccccgcttg aggaggagga agtgattatc 23760 
gagcaggacc caggttttgt aagcgaagac gacgaggacc gctcagtacc aacagaggat 23820 
aaaaagcaag accaggacaa cgcagaggca aacgaggaac aagtcgggcg gggggacgaa 23880 
aggcatggcg actacctaga tgtgggagac gacgtgctgt tgaagcatct gcagcgccag 23940 
tgcgccatta tctgcgacgc gttgcaagag cgcagcgatg tgcccctcgc catagcggat 24000 
gtcagccttg cctacgaacg ccacctattc tcaccgcgcg taccccccaa acgccaagaa 24060 
aacggcacat gcgagcccaa cccgcgcctc aacttctacc ccgtatttgc cgtgccagag 2412 0 
gtgcttgcca cctatcacat ctttttccaa aactgcaaga tacccctatc ctgccgtgcc 2418 0 
aaccgcagcc gagcggacaa gcagctggcc ttgcggcagg gcgctgtcat acctgatatc 2424 0 
gcctcgctca acgaagtgcc aaaaatcttt gagggtcttg gacgcgacga gaagcgcgcg 2430 0 
gcaaacgctc tgcaacagga aaacagcgaa aatgaaagtc actctggagt gttggtggaa 24360 
ctcgagggtg acaacgcgcg cctagccgta ctaaaacgca gcatcgaggt cacccacttt 2442 0 
gcctacccgg cacttaacct accccccaag gtcatgagca cagtcatgag tgagctgatc 24480 
gtgcgccgtg cgcagcccct ggagagggat gcaaatttgc aagaacaaac agaggagggc 2454 0 
ctacccgcag ttggcgacga gcagctagcg cgctggcttc aaacgcgcga gcctgccgac 24600 
ttggaggagc gacgcaaact aatgatggcc gcagtgctcg ttaccgtgga gcttgagtgc 24660 
atgcagcggt tctttgctga cccggagatg cagcgcaagc tagaggaaac attgcactac 24720 
acctttcgac agggctacgt acgccaggcc tgcaagatct ccaacgtgga gctctgcaac 24780 
ctggtctcct accttggaat tttgcacgaa aaccgccttg ggcaaaacgt gcttcattcc 24840 
acgctcaagg gcgaggcgcg ccgcgactac gtccgcgact gcgtttactt atttctatgc 24900 
tacacctggc agacggccat gggcgtttgg cagcagtgct tggaggagtg caacctcaag 24960 
gagctgcaga aactgctaaa gcaaaacttg aaggacctat ggacggcctt caacgagcgc 25020 
tccgtggccg cgcacctggc ggacatcatt ttccccgaac gcctgcttaa aaccctgcaa 25080 
cagggtctgc cagacttcac cagtcaaagc atgttgcaga actttaggaa ctttatccta 25140 
gagcgctcag gaatcttgcc cgccacctgc tgtgcacttc ctagcgactt tgtgcccatt 25200 
aagtaccgcg aatgccctcc gccgctttgg ggccactgct accttctgca gctagccaac 25260 
taccttgcct accactctga cataatggaa gacgtgagcg gtgacggtct actggagtgt 25320 
cactgtcgct gcaacctatg caccccgcac cgctccctgg tttgcaattc gcagctgctt 25380 
aacgaaagtc aaattatcgg tacctttgag ctgcagggtc cctcgcctga cgaaaagtcc 25440 
gcggctccgg ggttgaaact cactccgggg ctgtggacgt cggcttacct tcgcaaattt 25500 
gtacctgagg actaccacgc ccacgagatt aggttctacg aagaccaatc ccgcccgcca 25560 
aatgcggagc ttaccgcctg cgtcattacc cagggccaca ttcttggcca attgcaagcc 25620 
atcaacaaag cccgccaaga gtttctgcta cgaaagggac ggggggttta cttggacccc 25680 
cagtccggcg aggagctcaa cccaatcccc ccgccgccgc agccctatca gcagcagccg 25740 
cgggcccttg cttcccagga tggcacccaa aaagaagctg cagctgccgc cgccacccac 25 8 00 
ggacgaggag gaatactggg acagtcaggc agaggaggtt ttggacgagg aggaggagga 25 8 60 
catgatggaa gactgggaga gcctagacga ggaagcttcc gaggtcgaag aggtgtcaga 25920 
cgaaacaccg tcaccctcgg tcgcattccc ctcgccggcg ccccagaaat cggcaaccgg 259 80 
ttccagcatg gctacaacct ccgctcctca ggcgccgccg gcactgcccg ttcgccgacc 26040 
caaccgtaga tgggacacca ctggaaccag ggccggtaag tccaagcagc cgccgccgtt 26100 
agcccaagag caacaacagc gccaaggcta ccgctcatgg cgcgggcaca agaacgccat 26160 
agttgcttgc ttgcaagact gtgggggcaa catctccttc gcccgccgct ttcttctcta 2622 0 
ccatcacggc gtggccttcc cccgtaacat cctgcattac taccgtcatc tctacagccc 26280 
atactgcacc ggcggcagcg gcagcggcag caacagcagc ggccacacag aagcaaaggc 2634 0 
gaccggatag caagactctg acaaagccca agaaatccac agcggcggca gcagcaggag 2640 0 
gaggagcgct gcgtctggcg cccaacgaac ccgtatcgac ccgcgagctt agaaacagga 26460 
tttttcccac tctgtatgct atatttcaac agagcagggg ccaagaacaa gagctgaaaa 26520 
taaaaaacag gtctctgcga tccctcaccc gcagctgcct gtatcacaaa agcgaagatc 26580 
agcttcggcg cacgctggaa gacgcggagg ctctcttcag taaatactgc gcgctgactc 26 64 0 
ttaaggacta gtttcgcgcc ctttctcaaa tttaagcgcg aaaactacgt catctccagc 26700 
ggccacaccc ggcgccagca cctgtcgtca gcgccattat gagcaaggaa attcccacgc 26760 
cctacatgtg gagttaccag ccacaaatgg gacttgcggc tggagctgcc caagactact 2 6 820 
caacccgaat aaactacatg agcgcgggac cccacatgat atcccgggtc aacggaatcc 2 68 80 
gcgcccaccg aaaccgaatt ctcttggaac aggcggctat taccaccaca cctcgtaata 26940 
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accttaatcc 
ctgtggtact 
ttgcgggcgg 
tcagagggcg 
cggacgggac 
tcctaactct 
ttattgagga 
atccggatca 
gaatgttaag 
acaagtgctt 
tcgagggccc 
ttcgggagtt 
ctgtgatttg 
ctgagtataa 
gccaccgtct 
tctccctctg 
tccgagctca 
agtgcgtcac 
acagacctca 
ttaggccaaa 
ttctaattca 
gacgcagggc 
tgcaccagtg 
gtaataccac 
tcatggtggg 
ttcactcacc 
tcaaagatct 
tcagttagca 
tggtattgca 
tcctcctgtt 
agaccgtctg 
actgtgcctt 

ggggtactct 

aaaatgggca 
actgtgagcc 
ctcacagtta 
aacacactca 
gccacccaag 
ctcaccacca 
actggtagct 
ctaaagtacg 
ggtccaggtg 
tttgattcac 
aacagacgcc 
agactaggac 
aaaggccttt 
actgccaagg 
gaatttggtt 
ctagaatttg 
agcacaggtg 
ccagctccat 
ttaacaaaat 
ttggctccaa 
ggagtgctac 
cttactgaag 
ccaaaatctc 
gacaaaacta 
acaactccaa 
aatgaaatat 
gtttgtgtta 
attcagtagt 



ccgtagttgg 
tcccagagac 
ctttcgtcac 
aggtattcag 
atttcagatc 
gcagacctcg 
gtttgtgcca 
atttattcct 
tggagaggca 
tgcccgcgac 
ggcgcacggc 
tacccagcgc 
caactgtcct 
taaatacaga 
tcacccgccc 
tgatttacaa 
gctactccat 
cggccgctgc 
ataactctgt 
ggcgcagcta 
ggtttctcta 
agcggccgag 
caaaaggggt 
cggacaccgc 
agaaaagccc 
ttgtcaagga 
tattcccttt 
aatttctgtc 
gcttcctcct 
cctgtccatc 
aagatacctt 
ttcttactcc 
ctttgcgcct 
acggcctctc 
cacctctcaa 
cctcagaagc 
ccatgcaatc 
gacccctcac 
ccgatagcag 
tgggcattga 
gggctccttt 
tgactattaa 
aaggcaatat 
ttatacttga 
agggccctct 
acttgtttac 
ggttgatgtt 
cacctaatgc 
attcaaacaa 
ccattacagt 
ctcctaactg 
gtggcagtca 
tatctggaac 
taaacaattc 
gcacagccta 
acggtaaaac 
aacctgtaac 
gtgcatactc 
ttgccacatc 
tgtttcaacg 
atagccccac 



cccgctgccc 
gcccaggccg 
agggtgcggt 
ctcaacgacg 
ggcggcgccg 
tcctctgagc 
tcggtctact 
aactttgacg 
gagcaactgc 
tccggtgagt 
gtccggctta 
cccctgctag 
aaccttggat 
aattaaaata 
aagcaaacca 
cagtttcaac 
cagaaaaaac 
accacaccta 
ttaccagaac 
ctgtggggtt 
gaaatggacg 
caacagcgca 
atcttttgtc 
cttagctaca 
attaccataa 
cctgaggatc 
aactaataaa 
cagtttattc 
ggctgcaaac 
cgcacccact 
caaccccgtg 
tccctttgta 
atccgaacct 
tctggacgag 
aaaaaccaag 
cctaactgtg 
acaggccccg 
agtgtcagaa 
tacccttact 
cttgaaagag 
gcatgtaaca 
taatacttcc 
gcaacttaat 
tgttagttat 
ttttataaac 
agcttcaaac 
tgacgctaca 
accaaacaca 
ggctatggtt 
aggaaacaaa 
tagactaaat 
aatacttgct 
agttcaaagt 
cttcctggac 
tacaaacgct 
tgccaaaagt 
actaaccatt 
tatgtcattt 
ctcttacact 
tgtttatttt 
caccacatag 



tggtgtacca 
aagttcagat 
cgcccgggca 
agtcggtgag 
gccgtccttc 
cgcgctctgg 
ttaacccctt 
cggtaaagga 
gcctgaaaca 
tttgctactt 
ccgcccaggg 
ttgagcggga 
tacatcaaga 
tactggggct 

aggcgaacct 

ccagacggag 
accaccctcc 
ccgcctgacc 
aggaggtgag 
tatgaacaat 
gaattattac 
tgaatcaaga 
tggtaaagca 
agttgccaac 
ctcagcactc 
tctgcaccct 
aaaaaataat 
agcagcacct 
tttctccaca 
atcttcatgt 
tatccatatg 
tcccccaatg 
ctagttacct 
gccggcaacc 
tcaaacataa 
gctgccgccg 
ctaaccgtgc 
ggaaagctag 
atcactgcct 
cccatttata 
gacgacctaa 
ttgcaaacta 
gtagcaggag 
ccgtttgatg 
tcagcccaca 
aattccaaaa 
gccatagcca 
aatcccctca 
cctaaactag 
aataatgata 
gcagagaaag 
acagfcttcag 
gctcatctta 
ccagaatatt 
gttggattta 
aacattgtca 
acactaaacg 
tcatgggact 
ttttcataca 
tcaattgcag 
cttatacaga 



ggaaagtccc 
gactaactca 
gggtataact 
ctcctcgctt 
attcacgcct 
aggcattgga 
ctcgggacct 
ctcggcggac 
cctggtccac 
tgaattgccc 
agagcttgcc 
caggggaccc 
tctttgttgc 
cctatcgcca 
tacctggtac 
tgagtctacg 
ttacctgccg 
gtaaaccaga 
cttagaaaac 
tcaagcaact 
agagcagcgc 
gctccaagac 
ggccaaagtc 
caagcgtcag 
ggtagaaacc 
tattaagacc 
aaagcatcac 
ccttgccctc 
atctaaatgg 
tgttgcagat 
acacggaaac 
ggtttcaaga 
ccaatggcat 
ttacctccca 
acctggaaat 
cacctctaat 
acgactccaa 
ccctgcaaac 
caccccctct 
cacaaaatgg 
acactttgac 
aagttactgg 
gactaaggat 
ctcaaaacca 
acttggatat 
agcttgaggt 
ttaatgcagg 
aaacaaaaat 
gaactggcct 
agctaacttt 
atgctaaact 
ttttggctgt 
ttataagatt 
ggaactttag 
tgcctaacct 
gtcaagttta 
gtacacagga 
ggtctggcca 
ttgcccaaga 
aaaatttcaa 
tcaccgtacc 



gctcccacca 
ggggcgcagc 
cacctgacaa 
ggtctccgtc 
cgtcaggcaa 
actctgcaat 
cccggccact 
ggctacgact 
tgtcgccgcc 
gaggatcata 
cgtagcctga 
tgtgttctca 
catctctgtg 
tcctgtaaac 
ttttaacatc 
agagaacctc 
ggaacgtacg 
ctttttccgg 
ccttagggta 
ctacgggcta 
ctgctagaaa 
atggttaact 
acctacgaca 
aaattggtgg 
gaaggctgca 
ctgtgcggtc 
ttacttaaaa 
ctcccagctc 
aatgtcagtt 
gaagcgcgca 
cggtcctcca 
gagtccccct 
gcttgcgctc 
aaatgtaacc 
atctgcaccc 
ggtcgcgggc 
acttagcatt 
atcaggcccc 
aactactgcc 
aaaactagga 
cgtagcaact 
agccttgggt 
tgattctcaa 
actaaatcta 
taactacaac 
taacctaagc 
agatgggctt 
tggccatggc 
tagttttgac 
gtggaccaca 
cactttggtc 
taaaggcagt 
tgacgaaaat 
aaatggagat 
atcagcttat 
cttaaacgga 
aacaggagac 
caactacatt 
ataaagaatc 
gtcatttttc 
ttaatcaaac 
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tcacagaacc ctagtattca acctgccacc 
ttctccccgg ctggccttaa aaagcatcat 
tatattccac acggtttcct gtcgagccaa 
gggcagctca cttaagttca tgtcgctgtc 
ttgcggttgc ttaacgggcg gcgaaggaga 
atcgtgcatc aggatagggc ggtggtgctg 
ccgctccgtc ctgcaggaat acaacatggc 
ccgcagcata aggcgccttg tcctccgggc 
agcacagtaa ctgcagcaca gcaccacaat 
gtatccaaag ctcatggcgg ggaccacaga 
gtagattaag tggcgacccc tcataaacac 
gttgtaattc accacctccc ggtaccatat 
caccatccta aaccagctgg ccaaaacctg 
actggaacaa tgacagtgga gagcccagga 
gatatcaatg ttggcacaac acaggcacac 
ctcccgcgtt agaaccatat cccagggaac 
actgcaggga agacctcgca cgtaactcac 
cagcagcgga tgatcctcca gtatggtagc 
atccctactg tacggagtgc gccgagacaa 
aaatggaacg ccggacgtag tcatatttcc 
cagatctgcg tctccggtct cgccgcttag 
actctctcaa agcatccagg cgccccctgg 
ccgctgccct gataacatcc accaccgcag 
cgttctgcga gtcacacacg ggaggagcgg 
tattccaaaa gattatccaa aacctcaaaa 
ccggtggcgt ggtcaaactc tacagccaaa 
acaatggctt ccaaaaggca aacggccctc 
tcagggtgaa tctcctctat aaacattcca 
tctcgccacc ttctcaatat atctctaagc 
aaaatctgct ccagagcgcc ctccaccttc 
attcaggttc ctcacagacc tgtataagat 
gatcccgtag gtcccttcgc agggccagct 
gcgcggccac ttccccgcca ggaaccttga 
tactcggagc tatgctaacc agcgtagccc 
ataaaatgca aggtgctgct caaaaaatca 
tcgtagtcat gctcatgcag ataaaggcag 
accatttttc tctcaaacat gtctgcgggt 
aaaacattta aacattagaa gcctgtctta 
gacggactac ggccatgccg gcgtgaccgt 
ccaccgacag ctcctcggtc atgtccggag 
gttgattcat cggtcagtgc taaaaagcga 
aggcgtagag acaacattac agcccccata 
aacacataaa cacctgaaaa accctcctgc 
acaacataca gcgcttcaca gcggcagcct 
cctattaaaa aaacaccact cgacacggca 
gccaagtgca gagcgagtat atataggact 
aaaacaccca gaaaaccgca cgcgaaccta 
acttcctcaa atcgtcactt ccgttttccc 
tacaattccc aacacataca agttactccg 
cacgccccgc gccacgtcac aaactccacc 
ataaggtata ttattgatga tg 

<210> 45 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KOI 
<400> 45 



-33- 



tccctcccaa cacacagagt acacagtcct 30660 
atcatgggta acagacatat tcttaggtgt 3072 0 
acgctcatca gtgatattaa taaactcccc 30780 
cagctgctga gccacaggct gctgtccaac 30840 
agtccacgcc tacatggggg tagagtcata 3 0900 
cagcagcgcg cgaataaact gctgccgccg 3 0 960 
agtggtctcc tcagcgatga ttcgcaccgc 31020 
acagcagcgc accctgatct cacttaaatc 31080 
attgttcaaa atcccacagt gcaaggcgct 31140 
acccacgtgg ccatcatacc acaagcgcag 31200 
gctggacata aacattacct cttttggcat 31260 
aaacctctga ttaaacatgg cgccatccac 31320 
cccgccggct atacactgca gggaaccggg 313 80 
ctcgtaacca tggatcatca tgctcgtcat 31440 
gtgcatacac ttcctcagga ttacaagctc 31500 
aacccattcc tgaatcagcg taaatcccac 31560 
gttgtgcatt gtcaaagtgt tacattcggg 31620 
gcgggtttct gtctcaaaag gaggtagacg 31680 
ccgagatcgt gttggtcgta gtgtcatgcc 3174 0 
tgaagcaaaa ccaggtgcgg gcgtgacaaa 31800 
atcgctctgt gtagtagttg tagtatatcc 31860 
cttcgggttc tatgtaaact ccttcatgcg 31920 
aataagccac acccagccaa cctacacatt 31980 
gaagagctgg aagaaccatg tttttttttt 32 040 
tgaagatcta ttaagtgaac gcgctcccct 32100 
gaacagataa tggcatttgt aagatgttgc 32160 
acgtccaagt ggacgtaaag gctaaaccct 32220 
gcaccttcaa ccatgcccaa ataattctca 32280 
aaatcccgaa tattaagtcc ggccattgta 32340 
agcctcaagc agcgaatcat gattgcaaaa 32400 
tcaaaagcgg aacattaaca aaaataccgc 3 2460 
gaacataatc gtgcaggtct gcacggacca 32 52 0 
caaaagaacc cacactgatt atgacacgca 32 58 0 
cgatgtaagc tttgttgcat gggcggcgat 32640 
ggcaaagcct cgcgcaaaaa agaaagcaca 3270 0 
gtaagctccg gaaccaccac agaaaaagac 32760 
ttctgcataa acacaaaata aaataacaaa 32 82 0 
caacaggaaa aacaaccctt ataagcataa 32 88 0 
aaaaaaactg gtcaccgtga ttaaaaagca 32 940 
tcataatgta agactcggta aacacatcag 33000 
ccgaaatagc ccgggggaat acatacccgc 33 06 0 
ggaggtataa caaaattaat aggagagaaa 3312 0 
ctaggcaaaa tagcaccctc ccgctccaga 33180 
aacagtcagc cttaccagta aaaaagaaaa 3324 0 
ccagctcaat cagtcacagt gtaaaaaagg 33300 
aaaaaatgac gtaacggtta aagtccacaa 33360 
cgcccagaaa cgaaagccaa aaaacccaca 33420 
acgttacgta acttcccatt ttaagaaaac 33480 
ccctaaaacc tacgtcaccc gccccgttcc 33540 
ccctcattat catattggct tcaatccaaa 33600 

33622 



WO 03/062400 
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atgaagcgcg 
accggtcctc 
gagagtcccc 
atgcttgcgc 
caaaatgtaa 
atatctgcac 
atggtcgcgg 
aaacttagca 
acatcaggcc 
ctaactactg 
ggaaaactag 
accgtagcaa 
ggagccttgg 
attgattctc 
caactaaatc 
attaactaca 
gttaacctaa 
ggagatgggc 
attggccatg 
cttagttttg 
ttgtggacca 
ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
agaggctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



-34- 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 



<210> 46 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOI 



<400> 46 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 






5 










10 










15 




Tyr 


Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 


Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 




20 








25 










30 






Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 


Glu 


Ser 


Pro 


Pro 


Gly 


Val 


Leu 


Ser 






35 








*0 










45 








Leu 


Arg 
50 


Leu 


Ser 


Glu 


Pro 


Leu 
55 


Val 


Thr 


Ser 


Asn 


Gly 
60 


Met 


Leu 


Ala 


Leu 


Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu 


Ala 


Gly 


Asn 


Leu 


Thr 


Ser 


65 








70 










75 










80 


Gin 


Asn 


Val 


Thr 


Thr 
85 


Val 


Ser 


Pro 


Pro 


Leu 
90 


Lys 


Lys 


Thr 


Lys 


Ser 
95 


Asn 


lie 


Asn 


Leu 


Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr 


Val 


Thr 


Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala 
115 


Ala 


Ala 


Ala 


Pro 


Leu 
120 


Met 


Val 


Ala 


Gly 


Asn 
125 


Thr 


Leu 


Thr 


Met 


Gin 
130 


Ser 


Gin 


Ala 


Pro 


Leu 
135 


Thr 


Val 


His 


Asp 


Ser 
14 0 


Lys 


Leu 


Ser 


He 


Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 


145 










150 










155 










160 


Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 
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165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 

180 185 190 

Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 ~ 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 " 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 ~ 310 315 320 

Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 330 335 

Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 

405 410 415 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

515 520 525 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 

530 535 540 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 

565 570 575 

Tyr He Ala Gin Glu 
580 



<210> 47 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 



inr.irv <-wo n-w^anna? t ■> 
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<223> 5F KOIRGD 
<400> 47 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
accggtcctc caactgtgcc ttttcttact- cctccctttg tatcccccaa tgggtttcaa 12 0 

gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 

atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 

caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa 300 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 42 0 

aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 480 

acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 54 0 

ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 600 

ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 66 0 

accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 72 0 

ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 

attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 84 0 

caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 90 0 

attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 

gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 102 0 

ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 10 80 

attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 114 0 

cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 12 00 

ttgtggacca caccagctcc atctcctaac tgtagactaa atgcagagaa agatgctaaa 1260 

ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct 132 0 

gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga 13 80 

tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt 1440 

agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac 1500 

ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt 1560 

tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag 162 0 

gaaacaggtg atcattgtga ttgtcgtggt gattgttttt gtacaactcc aagtgcatac 1680 

tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat atttgccaca 174 0 

tcctcttaca ctttttcata cattgcccaa gaataa 1776 

<210> 48 
<211> 591 
<212> PRT 

<213> Artificial Sequence 
<22 0> 

<223> 5F KOIRGD 



<400> 48 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 








5 










10 










15 




Tyr 


Asp 


Thr 


Glu 
20 


Thr 


Gly 


Pro 


Pro 


Thr 

25 


Val 


Pro 


Phe 


Leu 


Thr 
30 


Pro 


Pro 


Phe 


Val 


Ser 
35 


Pro 


Asn 


Gly 


Phe 


Gin 
40 


Glu 


Ser 


Pro 


Pro 


Gly 
45 


Val 


Leu 


Ser 


Leu 


Arg 
50 


Leu 


Ser 


Glu 


Pro 


Leu 
55 


Val 


Thr 


Ser 


Asn 


Gly 
60 


Met 


Leu 


Ala 


Leu 


Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu 


Ala 


Gly 


Asn 


Leu 


Thr 


Ser 


65 










70 










75 










80 


Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu 


Lys 


Lys 


Thr 


Lys 


Ser 


Asn 










85 










90 






95 




lie 


Asn 


Leu 


Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr 


Val 


Thr 


Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala 


Gly 


Asn 


Thr 


Leu 


Thr 






115 










120 








125 








Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 



130 135 140 
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Ala 


Thr Gin Gly 


Pro 


Leu 


Thr 


Val 


Ser Glu 


Gly 


Lys 


Leu Ala 


Leu 


Gin 


145 










150 








155 








160 


Thr 


Ser Gly 


Pro 


Leu 


Thr 


Thr 


Thr Asp Ser 


Ser 


Thr 


Leu Thr 


lie 


Thr 










165 








170 








175 




Ala 


Ser 


Pro 


Pro 
180 


Leu 


Thr 


Thr 


Ala 


Thr Gly 
185 


Ser 


Leu 


Gly He 
190 


Asp 


Leu 


Lys 


Glu 


Pro 


He 


Tyr Thr Gin Asn Gly Lys 


Leu 


Gly 


Leu Lys 


Tyr Gly 




195 










200 








205 






Ala 


Pro 
210 


Leu 


His 


Val 


Thr 


Asp 
215 


Asp 


Leu Asn 


Thr 


Leu 
220 


Thr Val 


Ala 


Thr 


Gly 


Pro Gly Val 


Thr 


He 


Asn 


Asn 


Thr Ser 


Leu 


Gin 


Thr Lys 


Val 


Thr 












230 








235 








240 


Gly 


Ala Leu Gly Phe Asp 


Ser Gin Gly Asn Met 


Gin 


Leu Asn 


Val 


Ala 








245 








250 








255 




Gly 


Gly Leu Arg 


He 


Asp 


Ser 


Gin 


Asn Arg 


Arg 


Leu 


He Leu 


Asp 


Val 






260 










265 






270 








Tyr 


Pro 


Phe Asp Ala Gin Asn Gin Leu Asn 


Leu Arg Leu Gly Gin 




275 










280 








285 






Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His Asn 


Leu 


Asp 


He Asn 


Tyr Asn 


290 










295 








300 








Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


Ser Asn 


Asn 


Ser 


Lys Lys 


Leu 


Glu 


one 






310 








315 








320 


Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly Leu Met 


Phe 


Asp Ala Thr 


Ala 


He 










325 








330 








335 




.Ala 


He 


Asn 


Ala 


Gly Asp 


Gly 


Leu 


Glu Phe 


Gly 


Ser 


Pro Asn 


Ala 


Pro 








340 








345 






350 






Ren 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 


He Gly His 


Gly Leu Glu 


Phe Asp 






355 










360 








365 






Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 


Leu Gly Thr 


Gly 


Leu Ser 


Phe 


Asp 




370 








375 








380 








■ Ser 


Thr Gly Ala 


He 


Thr 


Val 


Gly Asn Lys 


Asn 


Asn Asp Lys 


Leu 


Thr 


• -a q c 










390 








395 








400 


• Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Glu 


Ala Asn 


Cys 


Arg 


Leu Asn 


Ala 


Glu 








405 








410 








415 




Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu Thr 


Lys 


Cys 


Gly Ser Gin 


He 




420 










425 






430 






j,pi) 


Ala 


Thr 
435 


Val 


Ser 


Val 


Leu 


Ala 
440 


Val Lys 


Gly 


Ser 


Leu Ala 
445 


Pro 


He 


OCJL 


Gly 
450 


Thr 


Val 


Gin 


Ser 


Ala 
455 


His 


Leu He 


He 


Arg 
460 


Phe Asp 


Glu 


Asn 


Glv 

J; 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu Asp 


Pro 


Glu 


Tyr Trp 


Asn 


Phe 


465 










470 








475 








480 


Arg 


Asn 


Gly Asp 


Leu 


Thr 


Glu 


Gly 


Thr Ala 


Tyr 


Thr 


Asn Ala 


Val 


Gly 








485 








490 








495 




Phe 


Met 


Pro 


Asn 
500 


Leu 


Ser 


Ala 


Tyr 


Pro Lys 
505 


Ser 


His 


Gly Lys 
510 


Thr 


Ala 


Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 


Tyr Leu 


Asn 


Gly Asp Lys 


Thr 


Lys 




515 










520 








525 






Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn Gly 


Thr 


Gin 


Glu Thr 


Gly Asp 




530 










535 








540 








His 


Cys 


Asp 


Cys 


Arg 


Gly 


Asp 


Cys 


Phe Cys 


Thr 


Thr 


Pro Ser 


Ala 


Tyr 


545 










550 








555 








560 


Ser 


Met 


Ser 


Phe 


Ser 


Trp 


Asp 


Trp 


Ser Gly His 


Asn 


Tyr He 


Asn 


Glu 










565 






570 








575 




He 


Phe 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 


Phe Ser 


Tyr 


He 


Ala Gin 


Glu 





580 585 590 



<210> 49 
<211> 1746 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> 5F K012 



<400> 49 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 

gttaaaggca 

tttgacgaaa 

agaaatggag 

ctatcagctt 

tacttaaacg 

gaaacaggag 

cacaactaca 

gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 

cggggctcct 

tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
atctcctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgttcactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atggaggcgg 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



<210> 50 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F K012 
<400> 50 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 








5 










10 










15 




Tyr Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 


Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


Phe 






20 










25 










30 






Val 


Ser 
35 


Pro 


Asn 


Gly 


Phe 


Gin 
40 


Glu" 


Ser 


Pro 


Pro 


Gly 
45 


Val 


Leu 


Ser 


Leu Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 


Thr 


Ser 


Asn Gly 


Met 


Leu 


Ala 


Leu 




50 










55 










60 










Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu 


Ala Gly 


Asn 


Leu 


Thr 


Ser 


65 










70 










75 










80 


Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu 


Lys 


Lys 


Thr 


Lys 


Ser 


Asn 










85 










90 




95 




lie 


Asn 


Leu 


Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr 


Val 


Thr 


Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala 


Gly 


Asn 


Thr 


Leu 


Thr 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 
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115 120 



Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 




130 










135 






Ala 


Thr 


mi n 


ml \7 

uxy 


XT X LJ 


Leu 


xxxx 


Val 

v dX 


Q (=J 

OCX 


145 








iJU 








Thr 


Ser 


vjxy 


Pro 


Leu 


Thr 


Thr 


Thr 












165 










Al a 


Q (=i 
OCX. 


XT X w 


IT x lj 


T.aii 
XjC LX 


X11X 


xxxx 


AT a 


Tlrr 

X ilX 








180 










185 


xj_y o 


ml ii 


rxu 


xxc 


xyr 


X liX 


v3Xil 


A Cn 


f?l V 

uiy 






X J ? «J 










POD 
^ \j kj 




Ala 


AT X LJ 


XJCU 


xx x o 


Va 1 
vox 


XXIX 


A cn 


A cn 
rib Lj> 


XJC LX 




210 










215 




Glv 


Pro 


Gly 


Val 


Thr 


He 

X 1C 


Asn 


Asn 


Thr 


225 










230 








Gly 


JTiX ct 


UC Li. 


ml \r 
oxy 


xriic; 


Z\ en 


OCX 


Ulil 


mi v 

oxy 










*± _> 










ox y 


\j x y 


XjC lx 


Arg 


liu 


ASp 


OCX 


vjjXIX 


A cn 








O O 










265 


O >■» 


xyr 


XJ>~r^ 


XT IlC 


ASp 


nla 


ml n 


Asn 


al rt 






.^S / O 














mi v 


JTX i_j 


Leu 


±r lit: 


11C 


Asn 


Q q I— 
OCX 


AT a 


XXX s 




290 










OQC 






f Lys 




LeU 


Tyr 


Leu 


rile 


Thr 


an a 
a j. a 


Ser 


one 










j 1U 








Veil 


Asn 


L6U 


Ser 


r T*Vi'V 

inr 


Axa 


Lys 


«xy 


Leu 










^ o e 

^ 3 










. r\J- a. 


Tl <a 

lie 


Asn 


Axa 


biy 


Asp 


biy 


Leu 


ulU 








i a n 














liXX 


Asn 


Pro 


Leu 


Lys 


xxlx 


Lys 


Tl o 

lie 






355 








-J O U 




Q -»—• 


Asn 


Lys 


AJL a. 


M<a+- 


vdl 


Pro 


Lys 


Leu 




7 7 n 

-J / U 










.3 / D 




Ser 


Thr 


mi iy 

uxy 


Al a 
rila 


Tip 

lie 


X tlx 


Val 


vjly 


Asn 


385 
















Leu 


arp 


X XXX 


Tiir 


Pro 


Hlo 


Pro 


Ser 


Pro 










a n c 

f± U D 










oix y 


Asp 


Al a 

riXd 


Lys 


Leu 


X ilx 


Leu 


Val 


Leu 








*? n 










4 Z D 


L6U 


AT a 


X XlX 


vai 


Ser 


Val 

vax 


Leu 


Zil a 

Aia 


v ai 






435 










a a n 






mi \7 


X XXX 


v d.X 


mi n 

L3X11 


ser 


a 

HI ct 


ill & 


_ 

Leu 




450 










455 






Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


465 










470 








Arg 


Asn 


Gly 


Asp 


Leu 


Thr 


Glu 


Gly 


Thr 










485 










Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 








500 










505 


Lys 


Ser 


Asn 


lie 


Val 


Ser 


Gin 


Val 


Tyr 






515 










520 




Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn 




530 










535 






Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


Ser 


Met 


Ser 


545 










550 








His 


Asn 


Tyr 


He 


Asn 


Glu 


He 


Phe 


Ala 








565 










Tyr 


lie 


Ala 


Gin 


Glu 











580 
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"IOC 

1Z J 








nib 


A cr> 


OCl x>y s 


XjCU. 


OCX 


Tip 

XXC 






1 Afi 








al ii 

OX LX 


Gly 


xj_y o xic lx 


Ala 

nxa 


XJC LX 


Gin 




ICC 
133 








1DU 


Cor 
OCX 


OCX 


XXXX XJCLX 


X 111 


Tip 
XXC 


X XXX 


1 / u 








I/O 




al \r 


Ser 


itcu uiy 


Tl <=» 

11C 


Asp 


XJCLX 








i o n 






Jjy s 


Leu 


uiy xicu 


Lys 


THr-r~ 

±yr 


dl \r 
oxy 






one 








Asn 


Txir 


jueu i iTx 


Val 


a 1 = 
Aia 


J. Ilx 














O Ci T~ 

OCX 


Leu 


ml n TVi y* 

V7XXX X 1XX 


Lys 


Val 

V Cll 


X XXI 




^ O O 










Asn 


» V 1C L. 


fil n T.oi i 


Asn 


Val 

V Ct X 


Al a 
nld 










o c c 
Job 




Arg 


Arg 


XJCLX XXC 


Leu 


Asp 


Val 
v dX. 








o *7 n 






Leu 


Asn 


ucu fii y 


Leu 


ml \r 
oxy 


ai n 






O D C 

Job 








Asn 


Leu 


Asp He 


Asn 


xyr 


Asn 






O f\ f\ 








Asn 


7\ cn 


Ser Ly s 


Lys 


Leu 


ml 11 

OX LX 












320 


rlcL 


ir XXC 


HSp a1 d 


X ill 


Al a 


Tip 

lie 


330 








o o c 
33 b 




irne 


t3±y 


Ser Pro 


Asn 


A±a 


Pro 








350 








Hi s 


Gly Leu 


f*l n 
bill 


irne 


Asp 






o c c 
job 








biy 


xnr 


Gly Leu 


Ser 


rxie 


Asp 






n o r\ 
3 80 








Lys 


Asn 




Lys 


Leu 


X 1X1 




*3 Q C 








4UU 


Asn 


Cys 


Ser Leu 


Asn 


uiy 


mi 
oiy 


/inn 
410 








415 




Thr 


Lys 


LyS biy 


Ser 


oin 


Tl <=» 
1 1c 








430 






Lys 


vj±y 


T 

Ser Leu 


ax a 


Pro 


Tip 






445 








xxe 


x±e 


Arg Phe 


Asp 


mi ii 

brlU 


Asn 






460 








Asp 


Pro 


Glu Tyr 


Trp 


Asn 


Phe 




475 








480 


Ala 


Tyr 


Thr Asn 


Ala 


Val 


Gly 


490 








495 




Lys 


Ser 


His Gly 


Lys 


Thr 


Ala 








510 






Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 






525 








Gly 


Thr 


Gin Glu 


Thr 


Gly 


Asp 






540 








Phe 


Ser 


Trp Asp 


Trp 


Ser 


Gly 




555 








560 


Thr 


Ser 


Ser Tyr 


Thr 


Phe 


Ser 


570 






575 
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<210> 51 
<211> 1746 ' 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F S* 



<400> 51 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 

gttaaaggca 

tttgacgaaa 

agaaatggag 

ctatcagctt 

tacttaaacg 

gaaacaggag 

cacaactaca 

gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
atctcctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



<210> 52 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F S* 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
ggagccggag 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
cctcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 



<400> 52 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

15 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 

20 25 30 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 

35 40 45 

Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 
85 90 95 
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Ile Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 

100 105 110 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 

115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 

130 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 " 150 155 160 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 

165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 

180 185 190 

Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr lie Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg lie Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val 

260 * 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 " 280 285 

• Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 ~ 310 315 320 

. Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 ~ ~ 33 0 335 

7- Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 350 

-Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370. " 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu 

405 410 415 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

515 520 525 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 

530 535 540 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 

565 570 575 

Tyr He Ala Gin Glu 



v-Y^irv ^\Air> n*a r^eo a r\r\ a o i ^ 
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580 



<210> 53 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F S*RGD 
<400> 53 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 

accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 120 

gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 

atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 24 0 

caaaatgtaa ccactgtgag cccacctctc ggagccggag cctcaaacat aaacctggaa 300 

atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 

atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 42 0 

aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 4 80 

acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 

ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 600 

ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 

accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 720 

ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 

attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 84 0 

caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 900 

attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 

gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 1020 

ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 10 80 

attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 

cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 12 00 

ttgtggacca caccagctcc atctcctaac tgtagactaa atgcagagaa agatgctaaa 1260 

ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct 1320 

gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga 1380 

tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt 144 0 

agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac 1500 

ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt 1560 

tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag 1620 

gaaacaggtg atcattgtga ttgtcgtggt gattgttttt gtacaactcc aagtgcatac 1680 

tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat atttgccaca 174 0 

tcctcttaca ctttttcata cattgcccaa gaataa 1776 

<210> 54 
<211> 591 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F S*RGD 
<400> 54 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

15 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 

20 25 30 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 

35 40 45 

Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
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65 








Gin 


Asn 


Val 


Thr 


lie 


Asn 


Leu 


Glu 








100 


Thr 


Val 


Ala 


Ala 






115 




Met 


Gin 


Ser 


Gin 




130 






Ala 


Thr 


Gin 


Gly 


145 








Thr 


Ser 


Gly 


Pro 


Ala 


Ser 


Pro 


Pro 








180 


Lys 


Glu 


Pro 


lie 






195 




Ala 


Pro 


Leu 


His 




210 






Gly 


Pro 


Gly 


Val 


225 








Gly Ala 


Leu 


Gly 


Gly Gly 


Leu 


Arg 








260 


Ser 


Tyr 


Pro 


Phe 




275 




Gly Pro 


Leu 


Phe 




290 






Lys Gly 


Leu 


Tyr 


305 








Val 


Asn 


Leu 


Ser 


Ala 


lie 


Asn 


Ala 








340 


Asn 


Thr 


Asn 


Pro 






355 




Ser 


Asn 


Lys 


Ala 




370 






Ser 


Thr 


Gly 


Ala 


385 








Leu 


Trp 


Thr 


Thr 


Lys 


Asp 


Ala 


Lys 








420 


Leu 


Ala 


Thr 


Val 






435 




Ser Gly 


Thr 


Val 




450 






Gly Val 


Leu 


Leu 


465 








Arg Asn 


Gly 


Asp 


Phe 


Met 


Pro 


Asn 








500 


Lys 


Ser 


Asn 


He 




515 




Pro 


Val 


Thr 


Leu 




530 






His 


Cys 


Asp 


Cys 


545 









70 

Thr Val Ser Pro 
85 

He Ser Ala Pro 

Ala Ala Pro Leu 
12 0 

Ala Pro Leu Thr 
135 

Pro Leu Thr Val 
150 

Leu Thr Thr Thr 
165 

Leu Thr Thr Ala 

Tyr Thr Gin Asn 
200 

Val Thr Asp Asp 
215 

Thr He Asn Asn 
230 

Phe Asp Ser Gin 
245 

He Asp Ser Gin 

Asp Ala Gin Asn 
280 

He Asn Ser Ala 
295 

Leu Phe Thr Ala 
310 

Thr Ala Lys Gly 
325 

Gly Asp Gly Leu 

Leu Lys Thr Lys 
360 

Met Val Pro Lys 
375 

He Thr Val Gly 
390 

Pro Ala Pro Ser 
405 

Leu Thr Leu Val 

Ser Val Leu Ala 
440 

Gin Ser Ala His 
455 

Asn Asn Ser Phe 
470 

Leu Thr Glu Gly 
485 

Leu Ser Ala Tyr 

Val Ser Gin Val 
520 

Thr He Thr Leu 
535 

Arg Gly Asp Cys 
550 



75 

Pro Leu Gly Ala 
90 

Leu Thr Val Thr 
105 

Met Val Ala Gly 

Val His Asp Ser 
140 

Ser Glu Gly Lys 
155 

Asp Ser Ser Thr 
170 

Thr Gly Ser Leu 
185 

Gly Lys Leu Gly 

Leu Asn Thr Leu 
220 

Thr Ser Leu Gin 

235 

Gly Asn Met Gin 
250 

Asn Arg Arg Leu 
265 

Gin Leu Asn Leu 

His Asn Leu Asp 
300 

Ser Asn Asn Ser 
315 

Leu Met Phe Asp 
330 

Glu Phe Gly Ser 
345 

He Gly His Gly 

Leu Gly Thr Gly 
380 

Asn Lys Asn Asn 
395 

Pro Asn Cys Arg 
410 

Leu Thr Lys Cys 
425 

Val Lys Gly Ser 

Leu He He Arg 
460 

Leu Asp Pro Glu 
475 

Thr Ala Tyr Thr 
490 

Pro Lys Ser His 
505 

Tyr Leu Asn Gly 

Asn Gly Thr Gin 
540 

Phe Cys Thr Thr 
555 



80 

Gly Ala Ser Asn 
95 

Ser Glu Ala Leu 
110 

Asn Thr Leu Thr 
125 

Lys Leu Ser He 

Leu Ala Leu Gin 
160 

Leu Thr He Thr 
175 

Gly He Asp Leu 
190 

Leu Lys Tyr Gly 
205 

Thr Val Ala Thr 

Thr Lys Val Thr 
240 

Leu Asn Val Ala 
255 

He Leu Asp Val 
270 

Arg Leu Gly Gin 
285 

He Asn Tyr Asn 

Lys Lys Leu Glu 
320 

Ala Thr Ala He 
335 

Pro Asn Ala Pro 
350 

Leu Glu Phe Asp 
365 

Leu Ser Phe Asp 

Asp Lys Leu Thr 
400 

Leu Asn Ala Glu 
415 

Gly Ser Gin He 
430 

Leu Ala Pro He 
445 

Phe Asp Glu Asn 

Tyr Trp Asn Phe 
480 

Asn Ala Val Gly 
495 

Gly Lys Thr Ala 
510 

Asp Lys Thr Lys 
525 

Glu Thr Gly Asp 

Pro Ser Ala Tyr 
560 
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Ser Met Ser Phe 

lie Phe Ala Thr 
580 



Ser Trp Asp Trp 
565 

Ser Ser Tyr Thr 



Ser Gly His Asn 
570 

Phe Ser Tyr lie 
585 



Tyr lie Asn Glu 
575 

Ala Gin Glu 
590 



<210> 55 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KOIS* 



<400> 55 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 

gttaaaggca 

tttgacgaaa 

agaaatggag 

ctatcagctt 

tacttaaacg 

gaaacaggag 

cacaactaca 

gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
agaggctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
ggagccggag 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
cctcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 



<210> 56 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOIS* 
<400> 56 

Met Lys Arg Ala- Arg Pro Ser Glu Asp Thr Phe 

1 5 10 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro 

20 25 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro 
35 40 



Ash Pro Val Tyr Pro 
15 

Phe Leu Thr Pro Pro 
30 

Pro Gly Val Leu Ser 
45 
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Leu 


Arg 
50 


Leu 


Ser 


Glu 


Pro 


Leu 
55 


Val 


Thr 


Ser 


Asn 


Gly Met 
60 


Leu 




Leu 


Lys 


Met 


Gly Asn Gly 


Leu 


Ser 


Leu Asp 


Glu 


Ala 


Gly Asn 


Leu 


Thr* 




65 










70 










75 








q n 
o u 


bin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu Gly Ala Gly Ala 


Ser 


Asn 










85 










90 








95 




xie 


Asn 


Leu 


Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr 


Val 


Thr Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala 


Gly Asn Thr 


Leu 


Thr 






115 










120 








125 








Met 


Gin 
130 


Ser 


Gin 


Ala 


Pro 


Leu 
135 


Thr 


Val 


His 


Asp 


Ser Lys 
140 


Leu 


Ser 


He 


Ala 


Thr Gin Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu Gly Lys Leu 


Ala 


Leu 


Gin 


145 










15 0 










155 








160 


Thr 


Ser Gly 


Pro 


Leu Thr Thr Thr Asp 


Ser 


Ser 


Thr Leu 


Thr 


He 


Thr 










165 










170 








175 




Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 


Thr 


Gly Ser Leu Gly 


He 


Asp 


Leu 








180 










185 








190 






Lys 


Glu 


Pro 


He 


Tyr Thr Gin Asn Gly 


Lys 


Leu Gly Leu Lys 


Tyr Gly 






195 










200 








205 








Ala 


Pro 


Leu 


His 


Val 


Thr Asp Asp 


Leu 


Asn 


Thr 


Leu Thr 


Val 


Ala 


Thr 




210 










215 










220 








Gly 


Pro Gly Val 


Thr 


He 


Asn 


Asn 


Thr 


Ser 


Leu 


Gin Thr 


Lys 


Val 


Thr 


225 










230 










235 








240 


Gly Ala Leu Gly 


Phe 


Asp 


Ser Gin Gly 


Asn 


Met 


Gin Leu 


Asn 


Val 


Ala 










245 








250 








255 




Gly Gly 


Leu 


Arg 
260 


He Asp 


Ser 


Gin 


Asn 


Arg Arg 


Leu He 


Leu 


Asp 


Val 
















265 








270 






Ser 


Tyr 


Pro 


Phe Asp Ala Gin Asn Gin 


Leu 


Asn 


Leu Arg 


Leu 


Gly Gin 




275 










280 








285 








Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His 


Asn 


Leu Asp He 


Asn 


Tyr Asn 


290 










295 










300 








Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


Sex- 


Asn 


Asn 


Ser Lys 


Lys 


Leu 


Glu 


305 








310 










315 








320 


Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 


Leu 


Met 


Phe 


Asp Ala Thr 


Ala 


He 










325 










330 








335 




Ala 


He 


Asn 


Ala 


Gly Asp 


Gly 


Leu 


Glu 


Phe 


Gly 


Ser Pro 


Asn 


Ala 


Pro 








340 










345 








350 






Asn 


Thr 


Asn 
355 


Pro 


Leu 


Lys 


Thr 


Lys 
360 


He 


Gly 


His 


Gly Leu 
365 


Glu 


Phe 


Asp 


Ser 


Asn 
370 


Lys 


Ala 


Met 


Val 


Pro 
375 


Lys 


Leu 


Gly 


Thr 


Gly Leu 
380 


Ser 


Phe 


Asp 


Ser 


Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly Asn 


Lys 


Asn 


Asn Asp 


Lys 


Leu 


Thr 


385 










390 










395 








400 


Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Glu 


Ala 


Asn 


Cys 


Arg Leu 


Asn 


Ala 


Glu 








405 










410 








415 




Lys 


Asp 


Ala 


Lys 
420 


Leu 


Thr 


Leu 


Val 


Leu 
425 


Thr 


Lys 


Cys Gly 


Ser 
430 


Gin 


He 


Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 


Val 


Lys 


Gly Ser Leu Ala 


Pro 


He 






435 










440 








445 








Ser Gly Thr Val 


Gin 


Ser 


Ala 


His 


Leu 


He 


He 


Arg Phe 


Asp 


Glu 


Asn 




450 










455 










460 


i 






Gly Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


Asp 


Pro 


Glu Tyr Trp 


Asn 


Phe 


465 










470 










475 








480 


Arg Asn Gly Asp 


Leu Thr Glu Gly Thr 


Ala 


Tyr 


Thr Asn 


Ala 


Val 


Gly 










485 










490 








495 




Phe 


Met 


Pro 


Asn 
500 


Leu 


Ser 


Ala 


Tyr 


Pro 
505 


Lys 


Ser 


His Gly 


Lys 
510 


Thr 


Ala 


Lys 


Ser 


Asn 


lie 


Val 


Ser 


Gin 


Val 


Tyr 


Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 




515 










520 








525 








Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn 


Gly Thr Gin Glu Thr 


Gly Asp 
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530 535 540 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 " *" 560 

His Asn Tyr lie Asn Glu lie Phe Ala Thr Ser Ser Tyr Thr Phe Ser 

565 570 " 575 

Tyr lie Ala Gin Glu 
580 



<210> 57 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F K01S*RGD 
<400> 57 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 120 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 
caaaatgtaa ccactgtgag cccacctctc ggagccggag cctcaaacat aaacctggaa 300 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 420 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 4 80 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 600 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 72 0 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 840 
caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 900 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 1020 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 1080 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 114 0 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 12 0 0 
ttgtggacca caccagctcc agaggctaac tgtagactaa atgcagagaa agatgctaaa 12 60 
ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct 1320 
gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga 13 80 
tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt 1440 
agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac 1500 
ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt 1560 
tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag 1620 
gaaacaggtg atcattgtga ttgtcgtggt gattgttttt gtacaaciicc aagtgcatac 1680 
tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat atttgccaca 1740 
tcctcttaca ctttttcata cattgcccaa gaataa ~ 1776 

<210> 58 
<211> 591 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KO!S*RGD 
<400> 58 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

15 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
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20 










25 






30 






Phe 


Val 


Ser 


Pro Asn Gly Phe 


Gin 


Glu 


Ser Pro Pro 


Gly 


Val 


Leu 


Ser 






35 










40 






45 








Leu Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 


Thr 


Ser Asn Gly Met 


Leu 


Ala 


Leu 




50 










55 






60 










Lys 


Met 


Gly Asn Gly Leu Ser 


Leu 


Asp 


Glu Ala Gly Asn 


Leu 


Thr 


Ser 


65 










70 








75 








80 


Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu Gly Ala Gly 


Ala 


Ser 


Asn 










85 










90 






95 




lie 


Asn 


Leu 


Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr Val Thr 


Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val Ala Gly Asn 


Thr 


Leu 


Thr 






115 










120 






125 








Met 


Gin 
130 


Ser 


Gin 


Ala 


Pro 


Leu 
135 


Thr 


Val 


His Asp Ser 
140 


Lys 


Leu 


Ser 


He 


Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu Gly Lys 


Leu 


Ala 


Leu 


Gin 


145 








150 








155 








160 


Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 


Asp 


Ser Ser Thr 


Leu 


Thr 


He 


Thr 








165 










170 






175 




Ala 


Ser 


Pro 


Pro 
180 


Leu 


Thr 


Thr 


Ala 


Thr 
185 


Gly Ser Leu 


Gly 


He 
190 


Asp 


Leu 


Lys 


Glu 


Pro 
195 


He 


Tyr 


Thr 


Gin 


Asn 
200 


Gly 


Lys Leu Gly 


Leu 
205 


Lys 


Tyr 


Gly 


Ala 


Pro 


Leu 


His 


Val 


Thr Asp Asp 


Leu 


Asn Thr Leu 


Thr 


Val 


Ala 


Thr 




210 










215 






220 










Gly 


Pro Gly Val 


Thr 


He 


Asn 


Asn 


Thr 


Ser Leu Gin 


Thr 


Lys 


Val 


Thr 


225 










230 








235 








240 


Gly Ala Leu Gly 


Phe 


Asp 


Ser 


Gin 


Gly 


Asn Met Gin 


Leu 


Asn 


Val 


Ala 










245 










250 






255 




Gly Gly Leu Arg 


He 


Asp 


Ser 


Gin 


Asn 


Arg Arg Leu 


He 


Leu 


Asp Val 








260 










265 






270 






Ser 


Tyr 


Pro 


Phe 


Asp Ala 


Gin 


Asn 


Gin 


Leu Asn Leu Arg 


Leu 


Gly Gin 




275 










280 






285 








Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His 


Asn Leu Asp 


He 


Asn 


Tyr Asn 


290 










295 






300 










Lys 


Gly Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


Ser 


Asn Asn Ser 


Lys 


Lys 


Leu 


Glu 


305 










310 








315 








320 


Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 


Leu 


Met Phe Asp Ala 


Thr 


Ala 


He 










325 






330 






335 




Ala 


He 


Asn 


Ala 


Gly Asp Gly Leu 


Glu 


Phe Gly Ser 


Pro 


Asn 


Ala 


Pro 








340 










345 






350 






Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 


He 


Gly His Gly Leu 


Glu 


Phe Asp 






355 










360 






365 








Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 


Leu 


Gly Thr Gly 


Leu 


Ser 


Phe 


Asp 




370 








375 






380 










Ser 


Thr Gly 


Ala 


He 


Thr 


Val 


Gly 


Asn 


Lys Asn Asn Asp 


Lys 


Leu 


Thr 


385 










390 






395 








400 


Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Glu 


Ala 


Asn Cys Arg 


Leu 


Asn 


Ala 


Glu 








405 










410 






415 




Lys 


Asp Ala Lys 


Leu 


Thr 


Leu 


Val 


Leu 


Thr Lys Cys 


Gly 


Ser 


Gin 


He 






420 










425 






430 






Leu 


Ala 


Thr 
435 


Val 


Ser 


Val 


Leu 


Ala 
440 


Val 


Lys Gly Ser 


Leu 
445 


Ala 


Pro 


He 


Ser 


Gly 
450 


Thr 


Val 


Gin 


Ser 


Ala 
455 


His 


Leu 


He He Arg 
460 


Phe 


Asp 


Glu 


Asn 


Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


Asp Pro Glu 


Tyr 


Trp 


Asn 


Phe 


465 










470 








47 5 








480 


Arg Asn Gly Asp 


Leu 


Thr 


Glu 


Gly 


Thr 


Ala Tyr Thr 


Asn 


Ala 


Val 


Gly 










485 










490 






495 




Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys Ser His 


Gly 


Lys 


Thr 


Ala 








500 








505 






510 
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Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 




515 










520 


Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 




530 










535 




His 


Cys 


Asp 


Cys 


Arg 


Gly Asp 


Cys 


545 










550 






Ser 


Met 


Ser 


Phe 


Ser 


Trp 


Asp 


Trp 










565 








He 


Phe 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 








580 









Tyr Leu Asn Gly Asp Lys Thr Lys 
525 

Asn Gly Thr Gin Glu Thr Gly Asp 
540 

Phe Cys Thr Thr Pro Ser Ala Tyr 
555 560 
Ser Gly His Asn Tyr lie Asn Glu 

570 575 
Phe Ser Tyr lie Ala Gin Glu 
585 590 



<210> 59 

<211> 972 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 35F 



<400> 59 

atgaccaaga 

agcacctccc 

agcccagacg 

ctacagctaa 

aacatacgtg 

aatggattag 

aacaacggtg 

ccacctaact 

ttagtaaaaa 

gtgaaccaaa 

tctggaaatc 

acagcgacca 

cccttcaaca 

actagttatg 

atttcttcca 

ccagaaagca 

gacgacgaat 



gagtccggct 
aacacccctt 
gagttcttac 
aagtgggagg 
ctacagcacc 
aaactcaaaa 
acatttgtat 
gtcaaattgt 
atggagggct 
tgttcacaca 
tattaactga 
gtgaaactgt 
ccactactag 
atagaagtct 
atgttgccta 
acatagctac 
aa 



cagtgactcc 
tataaaccca 
tttaaaatgt 
gggacttaca 
cattactaaa 
caataaacta 
aaaggatagt 
ggaaaacact 
tgttaatggc 
aaagacagca 
ggaatcagac 
agccagcagc 
ggatagtgaa 
atttcccttg 
tgccatacaa 
gctgaccaca 



ttcaaccctg 
gggtttattt 
ttaaccccac 
gtggatgaca 
aataatcact 
tgtgccaaat 
attaacacct 
aatacaaatg 
tacgtgtctc 
aacatccaat 
ttaaaaattc 
aaagccttta 
aactacattc 
aacatttcta 
tttgaatgga 
tccccctttt 



tctaccccta 
ccccaaatgg 
taacaaccac 
ctgatggtac 
ctgtagaact 
tgggaaatgg 
tatggactgg 
atggcaaact 
tagttggtgt 
taagattata 
cacttaaaaa 
tgccaagtac 
atggaatatg 
taatgctaaa 
atctaaatgc 
tcttttctta 



tgaagatgaa 60 
cttcacacaa 120 
aggcggatct 180 
cttacaagaa 240 
atccattgga 3 00 
gttaaaattt 360 
aataaaccct 420 
tactttagta 480 
atcagacact 540 
ttttgactct 600 
taaatcttct 660 
tacagcttat 720 
ttactacatg 780 
cagccgtatg 84 0 
aagtgaatct 900 
cattacagaa 960 
972 



<210> 60 
<211> 323 
<212> PRT 

<213> Artificial Sequence 



<220> 
<223> 35F 



<400> 60 



Met Thr Lys 


Arg 


Val 


Arg 


Leu 


Ser 


Asp 


Ser 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 




5 








10 










15 




Tyr Glu Asp 


Glu 


Ser 


Thr 


Ser 


Gin 


His 


Pro 


Phe 


He 


Asn 


Pro 


Gly 


Phe 




20 










25 










30 




He Ser Pro 


Asn 


Gly 


Phe 


Thr 


Gin 


Ser 


Pro 


Asp 


Gly 


Val 


Leu 


Thr 


Leu 


35 










40 










45 








Lys Cys Leu 


Thr 


Pro 


Leu 


Thr 


Thr 


Thr 


Gly 


Gly 


Ser 


Leu 


Gin 


Leu 


Lys 


50 








55 










60 










Val Gly Gly 


Gly 


Leu 


Thr 


Val 


Asp 


Asp 


Thr 


Asp 


Gly 


Thr 


Leu 


Gin 


Glu 


65 






70 










75 










80 


Asn He Arg 


Ala 


Thr 


Ala 


Pro 


He 


Thr 


Lys 


Asn 


Asn 


His 


Ser 


Val 


Glu 






85 










90 










95 




Leu Ser He 


Gly 


Asn 


Gly 


Leu 


Glu 


Thr 


Gin 


Asn 


Asn 


Lys 


Leu 


Cys 


Ala 
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100 


TjVS 


Leu 


Glv 


Asn 






115 




Asp 


Ser 


He 


Asn 




130 






Gin 


He 


Val 


Glu 


145 








Leu 


Val 


Lys 


Asn 


Val 


Ser 


Asp 


Thr 








180 


Gin 


Leu. 


Arcx 


Leu 






195 




Ser 


Asp 


Leu 


Lys 




210 






Glu 


Thr 


Val 


Ala 


225 








Pro 


Phe 


Asn 


Thr 


Cys 


Tyr 


Tyr 


Met 








260 


Ser 


He 


Met 


Leu 






275 




He 


Gin 


Phe 


Glu 




290 






He 


Ala 


Thr 


Leu 


305 








Asp 


Asp 


Glu 





Gly 


Leu 


Lys 


Phe 








120 


Thr 


Leu 


Trp 


Thr 






135 




Asn 


Thr 


Asn 


Thr 




150 






Gly 


Gly 


Leu 


Val 


165 








Val 


Asn 


Gin 


Met 


Tyr 


Phe 


Asp 


Ser 








200 


He 


Pro 


Leu 


Lys 






215 




Ser 


Ser 


Lys 


Ala 




230 






Thr 


Thr 


Arg 


Asp 


245 








Thr 


Ser 


Tyr 


Asp 


Asn 


Ser 


Arg 


Met 








280 


Trp 


Asn 


Leu 


Asn 






295 




Thr 


Thr 


Ser 


Pro 



310 



105 




Asn Asn Gly 


Asp 


Gly He Asn 


Pro 




140 


Asn Asp Gly 


Lys 


155 




Asn Gly Tyr 


Val 


170 




Phe Thr Gin 


Lys 


185 




Ser Gly Asn 


Leu 


Asn Lys Ser 


Ser 


220 


Phe Met Pro 


Ser 


235 




Ser Glu Asn 


Tyr 


250 




Arg Ser Leu 


Phe 


265 




He Ser Ser 


Asn 


Ala Ser Glu 


Ser 




300 


Phe Phe Phe 


Ser 


315 





110 

He Cys He Lys 
125 

Pro Pro Asn Cys 

Leu Thr Leu Val 
160 

Ser Leu Val Gly 
175 

Thr Ala Asn He 
190 

Leu Thr Glu Glu 
205 

Thr Ala Thr Ser 

Thr Thr Ala Tyr 
240 

He His Gly He 
255 

Pro Leu Asn He 
270 

Val Ala Tyr Ala 
285 

Pro Glu Ser Asn 

Tyr He Thr Glu 
320 



<210> 61 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 35F RGD 



<400> 61 

atgaccaaga 

agcacctccc 

agcccagacg 

ctacagctaa 

aacataegtg 

aatggattag 

aacaaeggtg 

ccacctaact 

ttagtaaaaa 

gtgaaccaaa 

tctggaaatc 

acagcgacca 

cccttcaaca 

actagttatg 

atttcttcca 

tttgaatgga 

tccccctttt 

<210> 62 
<211> 333 
<212> PRT 



gagtcegget 
aacacccctt 
gagttcttac 
aagtgggagg 
ctacagcacc 
aaactcaaaa 
acatttgtat 
gtcaaattgt 
atggagggct 
tgttcacaca 
tattaactga 
gtgaaactgt 
ccactactag 
atagaagtct 
atgtacattg 
atetaaatge 
tcttttctta 



cagtgactcc 
tataaaccca 
tttaaaatgt 
gggacttaca 
cattactaaa 
caataaacta 
aaaggatagt 
ggaaaacact 
tgttaatggc 
aaagacagca 
ggaatcagac 
agccagcagc 
ggatagtgaa 
atttcccttg 
tgattgtcgt 
aagtgaatct 
cattacagaa 



ttcaaccctg 
gggtttattt 
ttaaccccac 
gtggatgaca 
aataatcact 
tgtgccaaat 
attaacacct 
aatacaaatg 
tacgtgtctc 
aacatccaat 
ttaaaaattc 
aaagecttta 
aactacattc 
aacatttcta 
ggtgattgtt 
ccagaaagca 
gacgacgaat 



tctaccccta 
ccccaaatgg 
taacaaccac 
ctgatggtac 
ctgtagaact 
tgggaaatgg 
tatggactgg 
atggcaaact 
tagttggtgt 
taagattata 
cacttaaaaa 
tgecaagtae 
atggaatatg 
taatgctaaa 
tttgegcata 
acatagctac 
aa 



tgaagatgaa 
cttcacacaa 
aggeggatet 
cttacaagaa 
atccattgga 
gttaaaattt 
aataaaccct 
tactttagta 
atcagacact 
ttttgactct 
taaatcttct 
tacagcttat 
ttactacatg 
cagcegtatg 
tgccatacaa 
gctgaccaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1002 
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<213> Artificial Sequence 
<220> 

<223> 35F RGD 
<4O0> 62 

Met Thr Lys Arg Val Arg Leu Ser Asp Ser Phe Asn Pro Val Tyr Pro 

15 10 15 

Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe lie Asn Pro Gly Phe 

20 25 30 

lie Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val Leu Thr Leu 

35 40 45 

Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 

50 55 60 

Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 
55 70 75 ' 80 

Asn lie Arg Ala Thr Ala Pro lie Thr Lys Asn Asn His Ser Val Glu 

85 90 95 

Leu Ser lie Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 

100 105 110 

Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp lie Cys lie Lys 

115 120 125 

Asp Ser lie Asn Thr Leu Trp Thr Gly lie Asn Pro Pro Pro Asn Cys 

130 135 140 

Gin lie Val Glu Asn Thr Asn Thr Asn Asp Gly Lys Leu Thr Leu Val 
145 150 155 160 

Leu Val Lys Asn Gly Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly 

165 170 " 175 

Val Ser Asp Thr Val Asn Gin Met Phe Thr Gin Lys Thr Ala Asn lie 

180 185 190 

Gin Leu Arg Leu Tyr Phe Asp Ser Ser Gly Asn Leu Leu Thr Glu Glu 

195 200 205 

Ser Asp Leu Lys He Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser 

210 215 220 

Glu Thr Val Ala Ser Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr 
225 230 235 240 

Pro Phe Asn Thr Thr Thr Arg Asp Ser Glu Asn Tyr lie His Gly He 

245 250 255 

Cys Tyr Tyr Met Thr Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn He 

260 265 270 

Ser He Met Leu Asn Ser Arg Met He Ser Ser Asn Val His Cys Asp 

275 280 285 

Cys Arg Gly Asp Cys Phe Cys Ala Tyr Ala He Gin Phe Glu Trp Asn 

290 295 300 

Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn He Ala Thr Leu Thr Thr 
305 310 315 320 

Ser Pro Phe Phe Phe Ser Tyr He Thr Glu Asp Asp Glu 
325 330 



<210> 63 
<211> 1164 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 41sF 
<400> 63 

atgaaaagaa ccagaattga agacgacttc aaccccgtct acccctatga caccttctca 60 
actcccagca tcccctatgt agctccgccc ttcgtttctt ctgacgggtt acaggaaaaa 120 
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cccccaggag ttttagcact caagtacact gaccccatta ctaccaatgc taagcatgag 180 
cttactttaa aacttggaag caacataact ttagaaaatg ggttactttc ggccacagtt 240 
cccactgttt ctcctcccct tacaaacagt aacaactccc tgggtttagc cacatccgct 300 
cccatagctg tatcagctaa ctctctcaca ttggccaccg ccgcaccact gacagtaagc 360 
aacaaccagc ttagtattaa cgcgggcaga ggtttagtta taactaacaa tgccttaaca 420 
gttaatccta ccggagcgct aggtttcaat aacacaggag ctttacaatt aaatgctgca 480 
ggaggaatga gagtggacgg tgccaactta attcttcatg tagcatatcc ctttgaagca 540 
atcaaccagc taacactgcg attagaaaac gggttagaag taaccagcgg aggaaagctt 600 
aacgttaagt tgggatcagg cctccaattt gacagtaacg gacgcattgc tattagtaat 660 
agcaaccgaa ctcgaagtgt accatccctc actaccattt ggtctatctc gcctacgcct 720 
aactgctcca tttatgaaac ccaagatgca aacctatttc tttgtctaac taaaaacgga 780 
gctcacgtat taggtactat aacaatcaaa ggtcttaaag gagcactgcg ggaaatgcac 840 
gataacgctc tatctttaaa acttcccttt gacaatcagg gaaatttact taactgtgcc 900 
ttggaatcat ccacctggcg ttaccaggaa accaacgcag tggcctctaa tgccttaaca 960 
tttatgccca acagtacagt gtatccacga aacaaaaccg ctcacccggg caacatgctc 102 0 
atccaaatct cgcctaacat caccttcagt gtcgtctaca acgagataaa cagtgggtat 1080 
gcttttactt ttaaatggtc agccgaaccg ggaaaacctt ttcacccacc taccgctgta 1140 
ttttgctaca taactgaaga ataa ~ ~ 1164 

<210> 64 
<211> 387 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 41sF 



<400> 64 



Met 


Lys Arg 


Thr 


Arg 


He 


Glu 


Asp 


Asp 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


Tyr 


1 






5 










10 










15 


Asp 


Thr Phe 


Ser 


Thr 


Pro 


Ser 


He 


Pro 


Tyr 


Val 


Ala 


Pro 


Pro 


Phe 


Val 






20 










25 








30 






Ser 


Ser Asp 


Gly 


Leu 


Gin 


Glu 


Lys 


Pro 


Pro 


Gly 


Val 


Leu 


Ala 


Leu 


Lys 




35 










40 










45 






Tyr 


Thr Asp 


Pro 


He 


Thr 


Thr 


Asn 


Ala 


Lys 


His 


Glu 


Leu 


Thr 


Leu 


Lys 




50 








55 










60 








Leu 


Gly Ser 


Asn 


He 


Thr 


Leu 


Glu 


Asn 


Gly 


Leu 


Leu 


Ser 


Ala 


Thr 


Val 


65 








70 








75 










80 


Pro 


Thr Val 


Ser 


Pro 
85 


Pro 


Leu 


Thr 


Asn 


Ser 
90 


Asn 


Asn 


Ser 


Leu 


Gly 
95 


Leu 


Ala 


Thr Ser 


Ala 
100 


Pro 


He 


Ala 


Val 


Ser 
105 


Ala 


Asn 


Ser 


Leu 


Thr 
110 


Leu 


Ala 


Thr 


Ala Ala 
115 


Pro 


Leu 


Thr 


Val 


Ser 
120 


Asn 


Asn 


Gin 


Leu 


Ser 
125 


He 


Asn 


Ala 


Gly 


Arg Gly 
130 


Leu 


Val 


He 


Thr 
135 


Asn 


Asn 


Ala 


Leu 


Thr 
14 0 


Val 


Asn 


Pro 


Thr 


Gly 


Ala Leu 


Gly 


Phe 


Asn 


Asn 


Thr 


Gly 


Ala 


Leu 


Gin 


Leu 


Asn 


Ala 


Ala 


145 








150 










155 










160 


Gly 


Gly Met 


Arg 


Val 


Asp 


Gly 


Ala 


Asn 


Leu 


He 


Leu 


His 


Val 


Ala 


Tyr 








165 










170 










175 


Pro 


Phe Glu 


Ala 


lie 


Asn 


Gin 


Leu 


Thr 


Leu 


Arg 


Leu 


Glu 


Asn 


Gly 


Leu 






180 










185 








190 




Glu 


Val Thr 


Ser 


Gly 


Gly 


Lys 


Leu 


Asn 


Val 


Lys 


Leu 


Gly 


Ser 


Gly 


Leu 




195 










200 










205 






Gin 


Phe Asp 


Ser 


Asn 


Gly 


Arg 


He 


Ala 


He 


Ser 


Asn 


Ser 


Asn 


Arg 


Thr 




210 








215 










220 








Arg 


Ser Val 


Pro 


Ser 


Leu 


Thr 


Thr 


He 


Trp 


Ser 


He 


Ser 


Pro 


Thr 


Pro 


225 








230 








235 










240 


Asn 


Cys Ser 


He 


Tyr 
245 


Glu 


Thr 


Gin 


Asp 


Ala 
250 


Asn 


Leu 


Phe 


Leu 


Cys 
255 


Leu 


Thr 


Lys Asn 


Gly 


Ala 


His 


Val 


Leu 


Gly 


Thr 


He 


Thr 


He 


Lys 


Gly 


Leu 
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260 

Lys Gly Ala Leu Arg 
275 

Pro Phe Asp Asn Gin 
290 

Thr Trp Arg Tyr Gin 
305 

Phe Met Pro Asn Ser 
325 

Gly Asn Met Leu lie 
340 

Tyr Asn Glu He Asn 
355 

Glu Pro Gly Lys Pro 

370 
Thr Glu Glu 
385 



265 

Glu Met His Asp Asn Ala 
280 

Gly Asn Leu Leu Asn Cys 
295 

Glu Thr Asn Ala Val Ala 
310 315 
Thr Val Tyr Pro Arg Asn 
330 

Gin He Ser Pro Asn He 
345 

Ser Gly Tyr Ala Phe Thr 
360 

Phe His Pro Pro Thr Ala 
375 







270 






Leu 


Ser 


Leu 


Lys 


Leu 




285 






Ala 


Leu 


Glu 


Ser 


Ser 


300 










Ser 


Asn 


Ala 


Leu 


Thr 










320 


Lys 


Thr 


Ala 


His 


Pro 








335 




Thr 


Phe 


Ser 


Val 


Val 






350 






Phe 


Lys 


Trp 


Ser 


Ala 




365 








Val 


Phe 


Cys 


Tyr 


He 


380 











<210> 65 
<211> 1194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 41sF RGD 
<400> 65 

atgaaaagaa ccagaattga agacgacttc aaccccgtct acccctatga caccttctca 60 
actcccagca tcccctatgt agctccgccc ttcgtttctt ctgacgggtt acaggaaaaa 12 0 
cccccaggag ttttagcact caagtacact gaccccatta ctaccaatgc taagcatgag 180 
cttactttaa aacttggaag caacataact ttagaaaatg ggttactttc ggccacagtt 24 0 
cccactgttt ctcctcccct tacaaacagt aacaactccc tgggtttagc cacatccgct 300 
cccatagctg tatcagctaa ctctctcaca ttggccaccg ccgcaccact gacagtaagc 360 
aacaaccagc ttagtattaa cgcgggcaga ggtttagtta taactaacaa tgccttaaca 420 
gttaatccta ccggagcgct aggtttcaat aacacaggag ctttacaatt aaatgctgca 4 80 
ggaggaatga gagtggacgg tgccaactta attcttcatg tagcatatcc ctttgaagca 540 
atcaaccagc taacactgcg attagaaaac gggttagaag taaccagcgg aggaaagctt 600 
aacgttaagt tgggatcagg cctccaattt gacagtaacg gacgcattgc tattagtaat 660 
agcaaccgaa ctcgaagtgt accatccctc actaccattt ggtctatctc gcctacgcct 720 
aactgctcca tttatgaaac ccaagatgca aacctatttc tttgtctaac taaaaacgga 780 
gctcacgtat taggtactat aacaatcaaa ggtcttaaag gagcactgcg ggaaatgcac 840 
gataacgctc tatctttaaa acttcccttt gacaatcagg gaaatttact taactgtgcc 900 
ttggaatcat ccacctggcg ttaccaggaa accaacgcag tggcctctaa tgccttaaca 960 
tttatgccca acagtacagt gtatccacga aacaaaaccg ctcacccggg caacatgctc 1020 
atccaaatct cgcctaacat caccttcagt gtcgtctaca acgagataaa ctgtgattgt 10 80 
cgtggtgatt gtttttgtac tagtgggtat gcttttactt ttaaatggtc agccgaaccg 1140 
ggaaaacctt ttcacccacc taccgctgta ttttgctaca taactgaaga ataa 1194 

<210> 66 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 41sF RGD 
<400> 66 

Met Lys Arg Thr Arg He Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 

15 10 15 

Asp Thr Phe Ser Thr Pro Ser He Pro Tyr Val Ala Pro Pro Phe Val 
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20 



Ser Ser Asp 


Gly Leu Gin Glu 


Lys 


35 










40 


Tyr Thr Asp 


Pro 


He 


Thr 


Thr 


Asn 


50 








55 




Leu Gly Ser 


Asn 


He 


Thr 


Leu 


Glu 


65 






70 






Pro Thr Val 


Ser 


Pro 
85 


Pro 


Leu 


Thr 


Ala Thr Ser 


Ala 
100 


Pro 


He 


Ala 


Val 


Thr Ala Ala 


Pro 


Leu 


Thr 


Val 


Ser 


115 










120 


Gly Arg Gly 


Leu 


Val 


He 


Thr 


Asn 


130 








135 




Gly Ala Leu 


Gly 


Phe 


Asn 


Asn 


Thr 


145 






150 






Gly Gly Met 


Arg Val 


Asp Gly 


Ala 






165 








Pro Phe Glu 


Ala 
180 


He 


Asn 


Gin 


Leu 


Glu Val Thr 


Ser 


Gly Gly Lys 


Leu 


195 










200 


Gin Phe Asp 


Ser Asn Gly Arg 


He 


210 








215 




Arg Ser Val 


Pro 


Ser 


Leu 


Thr 


Thr 


225 






230 






Asn Cys Ser 


He 


Tyr 
245 


Glu 


Thr 


Gin 


Thr Lys Asn 


Gly Ala 


His 


Val 


Leu 




260 










Lys Gly Ala 


Leu 


Arg 


Glu 


Met 


'His 


275 










280 


Pro Phe Asp 


Asn 


Gin Gly Asn 


Leu 


290 








295 




Thr Trp Arg 


Tyr 


Gin 


Glu 


Thr 


Asn 


305 






310 






Phe Met Pro 


Asn 


Ser 


Thr 


Val 


Tyr 






325 






Gly Asn Met 


Leu 
340 


He 


Gin 


He 


Ser 


Tyr Asn Glu 


He 


Asn 


Cys 


Asp 


Cys 


355 










360 


Gly Tyr Ala 


Phe 


Thr 


Phe 


Lys 


Trp 


370 








375 




His Pro Pro 


Thr 


Ala 


Val 


Phe 


Cys 


385 






390 





25 






30 






Pro Pro Gly Val 


Leu 


Ala 


Leu 


Lys 






45 






Ala Lys His 


Glu 


Leu 


Thr 


Leu 


Lys 




60 








Asn Gly Leu 


Leu 


Ser 


Ala 


Thr 


Val 


75 










80 


Asn Ser Asn 


Asn 


Ser 


Leu 


Gly Leu 


90 








95 




Ser Ala Asn 


Ser 


Leu 


Thr 


Leu 


Ala 


105 






110 






Asn Asn Gin 


Leu 


Ser 
125 


He 


Asn 


Ala 


Asn Ala Leu 


Thr 
140 


Val 


Asn 


Pro 


Thr 


Gly Ala Leu Gin 


Leu 


Asn 


Ala 


Ala 


155 










160 


Asn Leu He 


Leu 


His 


Val 


Ala 


Tyr 


170 








175 


Thr Leu Arg 


Leu 


Glu 


Asn 


Gly Leu 


185 






190 






Asn Val Lys 


Leu 


Gly Ser 


Gly Leu 






205 








Ala He Ser 


Asn 


Ser 


Asn Arg 


Thr 




220 










He Trp Ser 


He 


Ser 


Pro 


Thr 


Pro 


235 










240 


Asp Ala Asn Leu 


Phe 


Leu 


Cys 


Leu 


250 








255 




Gly Thr He 


Thr 


He 


Lys 


Gly 'Leu 


265 






270 






Asp Asn Ala 


Leu 


Ser 
285 


Leu 


Lys 


LeU 


Leu Asn Cys 


Ala 
300 


Leu 


Glu 


Ser 


Ser 


Ala Val Ala 


Ser 


Asn 


Ala 


Leu 


Thr 


315 










320 


Pro Arg Asn Lys 


Thr 


Ala 


His 


Pro 


330 








335 




Pro Asn He 


Thr 


Phe 


Ser 


Val 


Val 


345 






350 






Arg Gly Asp Cys 


Phe 


Cys 


Thr 


Ser 






365 








Ser Ala Glu 


Pro 


Gly Lys 


Pro 


Phe 




380 










Tyr He Thr 


Glu 


Glu 









395 



<210> 67 
<211> 1737 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5 PD1 pent on 
<400> 67 

atgcggcgcg cggcgatgta tgaggaaggt 
gcggcgccag tggcggcggc gctgggttct 
gtgcctccgc ggtacctgcg gcctaccggg 



cctcctccct cctacgagag tgtggtgagc 60 
cccttcgatg ctcccctgga cccgccgttt 120 
gggagaaaca gcatccgtta ctctgagttg 180 
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gcacccctat tcgacaccac ccgtgtgtac ctggtggaca acaagtcaac ggatgtggca 240 
tccctgaact accagaacga ccacagcaac tttctgacca cggtcattca aaacaatgac 300 
tacagcccgg gggaggcaag cacacagacc atcaatcttg acgaccggtc gcactggggc 360 
ggcgacctga aaaccatcct gcataccaac atgccaaatg tgaacgagtt catgtttacc 420 
aataagttta aggcgcgggt gatggtgtcg cgcttgccta ctaaggacaa tcaggtggag 4 80 
ctgaaatacg agtgggtgga gttcacgctg cccgagggca actactccga gaccatgacc 54 0 
atagacctta tgaacaacgc gatcgtggag cactacttga aagtgggcag acagaacggg 60 0 
gttctggaaa gcgacatcgg ggtaaagttt gacacccgca acttcagact ggggtttgac 660 
cccgtcactg gtcttgtcat gcctggggta tatacaaacg aagccttcca tccagacatc 720 
attttgctgc caggatgcgg ggtggacttc acccacagcc gcctgagcaa cttgttgggc 780 
atccgcaagc ggcaaccctt cqaggagggc tttaggatca cctacgatga tctggagggt 840 
ggtaacattc ccgcactgtt ggatgtggac gcctaccagg cgagcttgaa agatgacacc 900 
gaacagggcg ggggtggcgc aggcggcagc aacagcagtg gcagcggcgc ggaagagaac 960 
tccaacgcgg cagccgcggc aatgcagccg gtggaggaca tgaacgatag ccgcggctac 1020 
ccctacgacg tgcccgacta cgcgggcacc agcgccacac gggctgagga gaagcgcgct 1080 
gaggccgaag cagcggccga agctgccgcc cccgctgcgc aacccgaggt cgagaagcct 1140 
cagaagaaac cggtgatcaa acccctgaca gaggacagca agaaacgcag ttacaaccta 1200 
ataagcaatg acagcacctt cacccagtac cgcagctggt accttgcata caactacggc 1260 
gaccctcaga ccggaatccg ctcatggacc ctgctttgca ctcctgacgt aacctgcggc 1320 
tcggagcagg tctactggtc gttgccagac atgatgcaag accccgtgac cttccgctcc 1380 
acgcgccaga tcagcaactt tccggtggtg ggcgccgagc tgttgcccgt gcactccaag 1440 
agcttctaca acgaccaggc cgtctactcc caactcatcc gccagtttac ctctctgacc 1500 
cacgtgttca atcgctttcc cgagaaccag attttggcgc gcccgccagc ccccaccatc 1560 
accaccgtca gtgaaaacgt tcctgctctc acagatcacg ggacgctacc gctgcgcaac 162 0 
agcatcggag gagtccagcg agtgaccatt actgacgcca gacgccgcac ctgcccctac 1680 
gtttacaagg ccctgggcat agtctcgccg cgcgtcctat cgagccgcac tttttga 1737 

<210> 68 
<211> 578 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ad5 PD1 pent on 
<400> 68 



Met 


Arg 


Arg 


Ala 


Ala 


Met 


Tyr 


Glu Glu Gly Pro Pro 


Pro 


Ser 


Tyr 


Glu 


1 


Val 






5 








10 










15 




Ser 


Val 


Ser 


Ala 


Ala 


Pro 


Val Ala 


Ala 


Ala 


Leu 


Gly 


Ser 


Pro 


Phe 








20 








25 








30 






Asp 


Ala 


Pro 


Leu 


Asp 


Pro 


Pro 


Phe Val 


Pro 


Pro 


Arg 


Tyr 


Leu 


Arg 


Pro 


Thr 




35 










40 






45 






Gly 


Gly 


Arg 


Asn 


Ser 


lie 


Arg Tyr 


Ser 


Glu 


Leu 


Ala 


Pro 


Leu 


Phe 




50 










55 






60 










Asp 


Thr 


Thr 


Arg 


Val 


Tyr 


Leu 


Val Asp 


Asn 


Lys 


Ser 


Thr 


Asp 


Val 


Ala 


65 










70 








75 








80 


Ser 


Leu 


Asn 


Tyr 


Gin 


Asn 


Asp 


His Ser 


Asn 


Phe 


Leu 


Thr 


Thr 


Val 


He 


Gin 








85 








90 










95 




Asn 


Asn 


Asp 
100 


Tyr 


Ser 


Pro 


Gly Glu 
105 


Ala 


Ser 


Thr 


Gin 


Thr 
110 


He 


Asn 


Leu 


Asp 


Asp 


Arg 


Ser 


His 


Trp 


Gly Gly Asp Leu Lys 


Thr 


He 


Leu 


His 


Thr 




115 










12 0 








125 








Asn 


Met 


Pro 


Asn 


Val 


Asn 


Glu Phe 


Met 


Phe 


Thr 


Asn 


Lys 


Phe 


Lys 


Ala 


130 










135 








140 






Arg 


Val 


Met 


Val 


Ser 


Arg 


Leu Pro 


Thr 


Lys 


Asp 


Asn 


Gin 


Val 


Glu 


145 










150 








155 








160 


Leu 


Lys 


Tyr 


Glu 


Trp 


Val 


Glu 


Phe Thr 


Leu 


Pro 


Glu 


Gly 


Asn 


Tyr 


Ser 


Glu 








165 








170 








175 




Thr 


Met 


Thr 


lie 


Asp 


Leu 


Met Asn 


Asn 


Ala 


He 


Val 


Glu 


His 


Tyr 








180 








185 










190 




Leu 


Lys 


Val 


Gly 


Arg 


Gin 


Asn 


Gly Val 


Leu 


Glu 


Ser 


Asp 


He 


Gly 


Val 
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195 200 205 

Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Phe Asp Pro Val Thr Gly 

210 215 " 220 

Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp He 
225 230 235 240 

He Leu Leu Pro Gly Cys Gly Val Asp Phe Thr His Ser Arg Leu Ser 

245 250 255 

Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe Arg 

260 265 270 

He Thr Tyr Asp Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp 

275 280 285 

Val Asp Ala Tyr Gin Ala Ser Leu Lys Asp Asp Thr Glu Gin Gly Gly 

290 295 300 

Gly Gly Ala Gly Gly Ser Asn Ser Ser Gly Ser Gly Ala Glu Glu Asn 
305 310 315 320 

Ser Asn Ala Ala Ala Ala Ala Met Gin Pro Val Glu Asp Met Asn Asp 

325 330 * 335 

Ser Arg Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Thr Ser Ala 

340 345 350 

Thr Arg Ala Glu Glu Lys Arg Ala Glu Ala Glu Ala Ala Ala Glu Ala 

355 360 365 

Ala Ala Pro Ala Ala Gin Pro Glu Val Glu Lys Pro Gin Lys Lys Pro 

370 375 380 

Val He Lys Pro Leu Thr Glu Asp Ser Lys Lys Arg Ser Tyr Asn Leu 
385 390 395 ' 400 

He Ser Asn Asp Ser Thr Phe Thr Gin Tyr Arg Ser Trp Tyr Leu Ala 

405 410 ~ 415 

Tyr Asn Tyr Gly Asp Pro Gin Thr Gly He Arg Ser Trp Thr Leu Leu 

420 425 430 

Cys Thr Pro Asp Val Thr Cys Gly Ser Glu Gin Val Tyr Trp Ser Leu 

435 440 445 

Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin He 

450 455 460 

Ser Asn Phe Pro Val Val Gly Ala Glu Leu Leu Pro Val His Ser Lys 
465 470 475 480 

Ser Phe Tyr Asn Asp Gin Ala Val Tyr Ser Gin Leu He Arg Gin Phe 

485 490 495 

Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He Leu 

500 505 510 

Ala Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val Pro 

515 520 525 

Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Asn Ser He Gly Gly 

530 535 540 

Val Gin Arg Val Thr He Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr 
545 550 555 " " 560 

Val Tyr Lys Ala Leu Gly lie Val Ser Pro Arg Val Leu Ser Ser Arg 
565 570 575 

Thr Phe 



<210> 69 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5TS35H 
<400> 69 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
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accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 120 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 
caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa 3 00 • 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 42 0 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 4 80 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 600 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 720 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 84 0 
caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 900 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 1020 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 1080 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 1200 
ttgtggaccg gaataaaccc tccacctaac tgtcaaattg tggaaaacac taatacaaat 12 60 
gatggcaaac ttactttagt attagtaaaa aatggagggc ttgttaatgg ctacgtgtct 132 0 
ctagttggtg tatcagacac tgtgaaccaa atgttcacac aaaagacagc aaacatccaa 13 80 
ttaagattat attttgactc ttctggaaat ctattaactg aggaatcaga cttaaaaatt 144 0 
ccacttaaaa ataaatcttc tacagcgacc agtgaaactg tagccagcag caaagccttt 15 0 0 
atgccaagta ctacagctta tcccttcaac accactacta gggatagtga aaactacatt 1560- 
catggaatat gttactacat gactagttat gatagaagtc tatttccctt gaacatttct 162 0- 
ataatgctaa acagccgtat gatttcttcc aatgttgcct atgccataca atttgaatgg 1680. 
aatctaaatg caagtgaatc tccagaaagc aacatagcta cgctgaccac atcccccttt 1740. 
ttcttttctt acattacaga agacgacgaa taa 1773 

<210> 70 
<211> 590 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5TS35H 



<400> 70 



Met 


Lys 


Arg Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr Phe Asn Pro 


Val 


Tyr 


Pro 


1 






5 










10 




15 




Tyr 


Asp 


Thr Glu 
20 


Thr 


Gly 


Pro 


Pro 


Thr 
25 


Val Pro Phe Leu 


Thr 
30 


Pro 


Pro 


Phe 


Val 


Ser Pro 


Asn 


Gly 


Phe 


Gin 


Glu 


Ser Pro Pro Gly Val 


Leu 


Ser 






35 








40 




45 








Leu 


Arg 
50 


Leu Ser 


Glu 


Pro 


Leu 
55 


Val 


Thr 


Ser Asn Gly Met 
60 


Leu 


Ala 


Leu 


Lys 


Met 


Gly Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu Ala Gly Asn Leu Thr 


Ser 


65 








70 








75 






80 


Gin 


Asn 


Val Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu Lys Lys Thr 


Lys 


Ser 


Asn 


lie 






85 










90 


95 




Asn 


Leu Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr Val Thr Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val Ala Gly Asn Thr Leu Thr 






115 








120 




125 








Met 


Gin 
130 


Ser Gin 


Ala 


Pro 


Leu 
135 


Thr 


Val 


His Asp Ser Lys 
140 


Leu 


Ser 


He 


Ala 


Thr Gin Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu Gly Lys Leu 


Ala 


Leu 


Gin 


145 








150 








155 






160 


Thr 


Ser 


Gly Pro 


Leu 
165 


Thr 


Thr 


Thr 


Asp 


Ser Ser Thr Leu 
170 


Thr 


He 
175 


Thr 
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Ala 


Ser 


Pro 


Pro 


Leu 


Thr Thr 


Ala 


Thr Gly Ser Leu Gly He 


Asp 


Leu 








180 








185 






190 




Lys 


Glu 


Pro 


He Tyr Thr Gin 


Asn 


Gly Lys Leu Gly Leu Lys 


Tyr Gly 






195 








200 






205 








Ala 


Pro 


Leu 


His 


Val 


Thr Asp 


Asp 


Leu Asn Thr 


Leu 


Thr 


Val 


Ala 


Thr 




210 








215 




220 










Gly 


Pro 


Gly 


Val 


Thr 


He Asn 


Asn 


Thr Ser Leu 


Gin 


Thr 


Lys 


Val 


Thr 


22S 










230 




235 








240 


Gly 


Ala 


Leu 


Gly Phe Asp Ser 


Gin Gly Asn Met 


Gin 


Leu 


Asn 


Val 


Ala 










245 






250 






i 


255 




Gly 


Gly 


Leu 


Arg 


He 


Asp Ser 


Gin Asn Arg Arg Leu 


He 


Leu 


Asp 


Val 








260 








265 






270 




Ser 


Tyr 


Pro 


Phe 


Asp 


Ala Gin 


Asn 


Gin Leu Asn 


Leu 


Arg 


Leu 


Gly 


Gin 






275 








280 






285 






Gly 


Pro 


Leu 


Phe 


He 


Asn Ser 


Ala 


His Asn Leu Asp 


He 


Asn 


Tyr 


Asn 




290 








295 






300 








Lys 


Gly 


Leu 


Tyr 


Leu 


Phe Thr 


Ala 


Ser Asn Asn 


Ser 


Lys 


Lys 


Leu 


Glu 


one 










310 




315 






320 


Val 


Asn 


Leu 


Ser 


Thr 


Ala Lys 


Gly 


Leu Met Phe 


Asp 


Ala 


Thr 


Ala 


He 










325 






330 






335 




Ala 


He 


Asn 


Ala Gly Asp Gly 


Leu 


Glu Phe Gly 


Ser 


Pro 


Asn 


Ala 


Pro 








340 








345 






350 






Asn 


Thr 


Asn 


Pro 


Leu 


Lys Thr 


Lys 


He Gly His 


Gly Leu Glu 


Phe 


Asp 






355 








360 






365 






Ser 


Asn 


Lys 


Ala 


Met 


Val Pro 


Lys 


Leu Gly Thr Gly Leu Ser 


Phe Asp 




370 








375 






380 










Ser 


Thr 


Gly 


Ala 


He 


Thr Val 


Gly Asn Lys Asn Asn Asp Lys 


Leu 


Thr 


385 










390 




395 










400 


Leu 


Trp 


Thr 


Gly He Asn Pro 


Pro 


Pro Asn Cys 


Gin 


He 


Val 


Glu 


Asn 










405 






410 








415 




Thr 


Asn 


Thr 


Asn Asp Gly Lys 


Leu 


Thr Leu Val 


Leu 


Val 


Lys 


Asn Gly 








420 








425 






430 






Gly 


Leu 


Val 


Asn Gly Tyr Val 


Ser 


Leu Val Gly Val 


Ser 


Asp 


Thr 


Val 






435 








440 






445 






Asn 


Gin 


Met 


Phe 


Thr 


Gin Lys 


Thr 


Ala Asn He 


Gin 


Leu Arg 


Leu Tyr 




450 








455 






460 










Phe 


Asp 


Ser 


Ser Gly Asn Leu 


Leu 


Thr Glu Glu 


Ser 


Asp 


Leu 


Lys 


He 


*x O -3 










470 




475 






480 


Pro 


Leu 


Lys 


Asn 


Lys 
485 


Ser Ser 


Thr 


Ala Thr Ser 
490 


Glu 


Thr 


Val 


Ala 
495 


Ser 


Ser 


Lys 


Ala 


Phe 
500 


Met 


Pro Ser 


Thr 


Thr Ala Tyr 
505 


Pro 


Phe 


Asn 
510 


Thr 


Thr 


Thr 


Arg 


Asp 


Ser 


Glu 


Asn Tyr 


He 


His Gly He 


Cys 


Tyr 


Tyr 


Met 


Thr 






515 








520 




525 






Ser 


Tyr 
530 


Asp 


Arg 


Ser 


Leu Phe 
535 


Pro 


Leu Asn He 


Ser 
540 


He 


Met 


Leu 


Asn 


Ser 


Arg 


Met 


He 


Ser 


Ser Asn 


Val 


Ala Tyr Ala 


He 


Gin 


Phe 


Glu 


trp 


545 










550 




555 










560 


Asn 


Leu 


Asn 


Ala 


Ser 
565 


Glu Ser 


Pro 


Glu Ser Asn 
570 


He 


Ala 


Thr 


Leu 
575 


Thr 


Thr 


Ser 


Pro 


Phe 


Phe 


Phe Ser 


Tyr 


He Thr Glu Asp Asp Glu 







580 585 590 



<210> 71 
<211> 945 
<212> DNA 
<213> Artificial 

<220> 

<223> 35TS5H 



Sequence 
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<400> 71 

atgaccaaga gagtccggct cagtgactcc 
agcacctccc aacacccctt tataaaccca 
agcccagacg gagttcttac tttaaaatgt 
ctacagctaa aagtgggagg gggacttaca 
aacatacgtg ctacagcacc cattactaaa 
aatggattag aaactcaaaa caataaacta 
aacaacggtg acatttgtat aaaggatagt 
tctcctaact gtagactaaa tgcagagaaa 
tgtggcagtc aaatacttgc tacagtttca 
atatctggaa cagttcaaag tgctcatctt 
ctaaacaatt ccttcctgga cccagaatat 
ggcacagcct atacaaacgc tgttggattt 
cacggtaaaa ctgccaaaag taacattgtc 
aaacctgtaa cactaaccat tacactaaac 
agtgcatact ctatgtcatt ttcatgggac 
tttgccacat cctcttacac tttttcatac 

<210> 72 
<211> 314 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 35TS5H 



ttcaaccctg tctaccccta tgaagatgaa 60 
gggtttattt ccccaaatgg cttcacacaa 120 
ttaaccccac taacaaccac aggcggatct 180 
gtggatgaca ctgatggtac cttacaagaa 240 
aataatcact ctgtagaact atccattgga 300 
tgtgccaaat tgggaaatgg gttaaaattt 360 
attaacacct tatggactac accagctcca 420 
gatgctaaac tcactttggt cttaacaaaa 480 
gttttggctg ttaaaggcag tttggctcca 540 
attataagat ttgacgaaaa tggagtgcta 600 
tggaacttta gaaatggaga tcttactgaa 66 0 
atgcctaacc tatcagctta tccaaaatct 720 
agtcaagttt acttaaacgg agacaaaact 780 
ggtacacagg aaacaggaga cacaactcca 84 0 
tggtctggcc acaactacat taatgaaata 900 
attgcccaag aataa 945 



<400> 72 

Met Thr Lys Arg Val Arg Leu Ser Asp Ser Phe Asn Pro Val Tyr Pro 

1 5 10 15 

Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe lie Asn Pro Gly Phe 

20 25 30 

lie Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val Leu Thr Leu 

35 40 45 

Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 

50 55 ~ 60 

Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 
65 . 70 75 80 

Asn lie Arg Ala Thr Ala Pro He Thr Lys Asn Asn His Ser Val Glu 

85 90 95 

Leu Ser He Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 

100 105 110 

Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp He Cys He Lys 

115 120 125 

Asp Ser He Asn Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys 

130 135 140 

Arg Leu Asn Ala Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys 
I 45 150 155 160 

Cys Gly Ser Gin He Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly 

165 170 175 

Ser Leu Ala Pro He Ser Gly Thr Val Gin Ser Ala His Leu He He 

180 185 190 

Arg Phe Asp Glu Asn Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro 

195 200 205 

Glu Tyr Trp Asn Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr 

210 215 220 

Thr Asn Ala Val Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser 
225 230 235 " 240 

Hxs Gly Lys Thr Ala Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn 

245 250 255 

Gly Asp Lys Thr Lys Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr 
260 265 270 
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Gln Glu Thr Gly Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser 

275 280 285 

Trp Asp Trp Ser Gly His Asn Tyr He Asn Glu He Phe Ala Thr Ser 

290 295 300 

Ser Tyr Thr Phe Ser Tyr He Ala Gin Glu 
305 310 
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